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An Overflow System in Which 
Queuing Takes Precedence 
By J. A. MORRISON 
(Manuscript received July 1, 1980) 


When calls offered to a primary group of trunks find all of them 
busy, provisions are often made for these calls to overflow to other 
groups of trunks. Such traffic overflow systems have been of interest 
for a long time, but recently overflow systems that allow for some 
calls to be queued have been of importance. In this paper we analyze 
a traffic overflow system with queuing, which consists of a primary 
and a secondary group. The system which we consider here differs 
from the two systems we investigated earlier, in that no overflow from 
the primary to the secondary is permitted if there is a waiting space 
available in the primary queue. As with the earlier investigations, we 
adopt an analytical approach which considerably reduces the dimen- 
stons of the problem, and simplifies the calculation of various steady- 
state quantities of interest. Our results include expressions for the 
loss probabilities, the average waiting times in the queues, and the 
average number of demands in service in each group. 


1. INTRODUCTION 


In this paper a particular overflow system with queuing is analyzed. 
The system consists of two groups, a primary and a secondary, with n; 
servers and gq; waiting spaces, which receive demands from indepen- 
dent Poisson sources S; with arrival rates A, > 0, Rk = 1 and 2, 
respectively, as depicted in Fig. 1. The service times of the demands 
are independent, and exponentially distributed with mean service rate 
p. > O. If all m2 servers in the secondary are busy when a demand from 
Ss arrives, the demand is queued if one of the q2 waiting spaces is 
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available, otherwise it is lost (blocked and cleared from the system). 
Demands waiting in the secondary queue enter service (in some 
prescribed order) as servers in the secondary become free. 

If all m; servers in the primary are busy when a demand from S, 
arrives, the demand is queued in the primary, if one of the gi waiting 
spaces is available. No overflow is permitted from the primary queue, 
so that a demand in the primary queue must wait for a server in the 
primary to become free. If all nm: servers in the primary are busy and 
all g, waiting spaces are occupied, when a demand from S; arrives, the 
demand is served in the secondary, if there is a free server and there 
are no demands waiting in the secondary queue, otherwise it is lost. 

The overflow system described above differs from the two systems 
which we investigated earlier,’” in that no overflow from the primary 
to the secondary is permitted if there is a waiting space available in 
the primary queue. This restriction was one invoked by Anderson.’ In 
the two systems investigated earlier, arriving calls can overflow when 
the primary queue is not full. The system considered in this paper is 
a particular case of the one considered by Rath,* which was composed 
of two queues, one of which is allowed to overflow to the other under 
specified conditions involving the queue lengths. He obtained some 
numerical solutions using a Gauss-Seidel iteration technique, but none 
of these correspond to the particular system that we are considering. 
He also developed an approximate procedure for analyzing his system, 
based on the use of the Interrupted Poisson Process. 

Here we analyze the overflow system using a technique analogous 
to the one introduced in the earlier paper.’ Let pi; denote the steady- 
state probability that there are i demands in the primary and / 
demands in the secondary, either in service or waiting. These proba- 
bilities satisfy a set of generalized birth-and-death equations, which 
take the form of partial difference equations connecting neighboring 
states. We carry out an analysis that reduces the dimensions of the 
problem, which may be considerable in cases of interest. The basic 
technique is to separate variables in the region away from a certain 
boundary of state space. This leads to an eigenvalue problem for the 
separation constant. The eigenvalues are roots of a polynomial equa- 
tion. The probabilities p;; are then represented in terms of the corre- 
sponding eigenfunctions. The constant coefficients in these represen- 
tations are determined from the boundary conditions and the normal- 
ization condition that the sum of the probabilities is unity. 

Various steady-state quantities are of interest, which may be ex- 
pressed in terms of the probabilities p;;. The quantities include the loss 
(or blocking) probabilities, the average waiting times in the queues, 
and the average number of demands in service in each group. These 
quantities may be expressed directly in terms of the constant coeffi- 
cients which occur in the representations for the probabilities p;;. Thus 
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the steady-state quantities of interest may be calculated directly, 
without having to calculate the probabilities p;;. Here again the reduc- 
tion in the dimensions of the problem is valuable. 

Only the theoretical results are presented in this paper. Numerical 
results will be presented in a forthcoming paper by Kaufman, Seery, 
and Morrison.’ Results will be given there for the two overflow systems 
considered previously, based on the earlier analysis,’ as well as for the 
system considered in this paper. 

Section II discusses the representation of the probabilities p,; in 
terms of the eigenfunctions, and the boundary and normalization 
conditions. Various steady-state quantities of interest are calculated in 
Section III. The appendix gives properties of the eigenfunctions. 

We assume throughout the analysis that gi = 1, since the system 
considered in this paper, and the two systems analyzed earlier, are 
identical if gq, = 0, ie., if there is no primary queue. However, the 
results of this paper reduce to those obtained earlier’ if gi = 0. If qe is 
large, or even infinite, an alternate analysis, analogous to that pre- 
sented for the other two systems,” may be carried out for the present 
system, but we do not pursue that here. 


li. REPRESENTATION AND BOUNDARY CONDITIONS 


' We let pi; denote the steady-state probability that there are 1 
demands in the primary and 7 demands in the secondary, either in 
service or waiting. These probabilities satisfy a set of generalized birth- 
and-death equations,° which may be derived in a straightforward 
manner. We define the traffic intensities 


a, = Ai/p, a2 = X2/p, (1) 


and let the total number of servers and waiting spaces in each group 
be 


kh =n + Q1, kz = no + qr. (2) 
It is convenient to introduce the function 
1, l=0, 
xI= fs 1<0, (8) 
as well as the Kronecker delta 
= i; L = I 
ou fi ix]. (4) 


Then the birth-and-death equations are 
[ai(1 — diz,Xj-n.) + ao(1 — S,) + min(Z, m1) + min(J, N2)] pi; 
= a,(1 = 6:0) Di-1,; + (1 => 550) (A180, Xn9—j ze Q2)Dij-1 
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+ (1 — 6,)min(t + 1, m1)pisi,; + (1 — 5,)min(7 + 1, n2)pij+1, (5) 


for 0 =i ski, 0 <7 S ke. The normalization condition is 


YY py=1. (6) 


For 2 # k,, the variables in (5) may be separated, and there are 
solutions of the form a,8;, where 


[a; + min(Z, m1) + pla; = ai(1 — dio)ai-1 + min(? + 1, mi)aini1, (7) 
for0<i<k, —1, and 
[a2(1 — d,) + min(J, n2) — p]B; 
= a2(1 — 80)Bj-1 + (1 — 8x,)min(j +1, n2)Bay (8) 
for 0 <7 S ko, and p is a separation constant. Hence, from (7), 
(a1 +i + p)a; = ai(1 — So)ai-i + (i + Lain, (9) 


for 0 <i =n, — 1. The solution of (9) may be expressed in terms of 
Poisson-Charlier”® polynomials. We here denote the solution of (9) for 
which ao = 1 by si(p, ai). The properties of s;(p, a) which we will need 
are given in the appendix. 

We assume that gi = 1. Then, from (7), 


(a) + m1 + p)ai = Qiai-1 + N1Ai41, (10) 


for n} =i =k; — 1. The solution of (10) may be expressed in terms of 
Chebyshev polynomials of the second kind,’ U,(x). It is convenient to 


define 
m\"_ (atntp 
aio) = (2) u(22 ute), (11) 
as a ; 2Vain, 


The appendix gives the properties of these functions that we need. We 
note here, however, that Uo(x) = 1 and U-_i(x) = 0. From (9), (10), 
(53), and (64), with a suitable normalization, it follows that 


1 


71 
(=) Si(p, a), Osis ni, 
ay 
a;(p) = h-i (12) 
(=) [Sn,(p, @1)Qi-n,(0)—Sn,-1(p, @1)Qi-n,-1(9)], 
1 
m=is ky. 
Next, from (8), 
(a2 +7 — p) By = a2(1 — So) Bj-1 + (7 + DB, (13) 
for O<jsnm- 1. It follows from (53) that B; is proportional to 


sj(—p, @2) for 0 S7 S ne. Also, 
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[a2(1 — dj.) + 2 — p]Bj = a28j-1 + no(1 — dj.) Bir, (14) 


for n2 <j = ke. Corresponding to (11), we define 


1/2 
Ne d2+ne-p 
Wisse t— | Dl (15) 
no) (=) ( 2V Aone 
We also define 
$;(p) = Vr,-(p) — Vr,-7-1(). (16) 


The appendix gives the properties of these functions that we need. It 
follows from (14) and (62) that 8; is proportional to $,(p) for nz -1< 
Js ko. 

Consequently, we take 


Bi ) = Si(—p, a2)n,(p); 0 <j = no, 
rj\ 0 Sn.(—Pp, A2)¢;(p), N2—-—1 <j = ko, (17) 


where 
Sn-1(—P, A2)hn,(P) = Sn,(—P, A2)n.-1(). (18) 
This equation may be written in the form 
pl Sn,(1 — p, A2)¥o,(p) — Sn,-1(1 — p, A2)¥y,-1()] = 0. (19) 


The expression in the square brackets in (19) is a polynomial in p of 
degree ko = nz + ge. It was shown’ that its zeros are positive and 
distinct, and we denote them by pm, m = 1, -++, Xe. We also let po = 0. 
It follows that we may represent the probabilities p;; in the form 

ko 

DY CmOi(Pm)S;(—Pm, A2)On(Pm); 0<j<n, 


m=0 
Pij = (20) 


ko 
d, CmOi(Pm)Sn.(—Pm, A2);(Pm); nSj sk, 


for 0 <1 S ky, where a;(p) is defined in (12), and the constants cn are 
to be determined. 

It remains to satisfy the boundary conditions corresponding to i = 
k,; in (5), as well as the normalization condition (6). If we set i = k; in 
(5), we obtain 


(a) + a2 + m1 + J)PR,j = G1 PR,-1,7 + (Ai + a2)(1 — 8j0)DR,,j-1 

+ (7 + 1)pr, jus, (21) 
for 0 <j <= ne - 1, 
[a2(1 — 54,0) + m + N2]Da,,ny 


= G1 Piy-1n, + (A1 + A2)Dhyng-1 + M2(1 — 84,,0)Pkyny+1, (22) 
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and, if q2 = 1, 
[a2(1 — djx,) + mi + n2)pr,,; 
= Q1Pz,-1,j7 + A2Pe,,j-1 + no(1 — Ojk)D hy, j+1s (23) 


fornga +1 Sj Sho. 

If we substitute (20) into (21), after reduction with the help of the 
recurrence relations in the appendix, we find that 
ho 


> Cm{Pm[Sn,(1 + Pm, Q;)Qg, (Pm) _ Sn,-1(1 + Pm) 1) Qg,—-1(Pm) ]S(—P my 2) 
m=0 


+ ai[Sn,(Pm; A1)Qq,(Pm) — Sn,-1(Oms A1)2Xq,-1(Pm) ]8(—1 — pm, A2)} 
*bn,(Om) = 0, (24) 
for 0 <j S n2 — 1. Also, from (23), it is found that 


ko 
d, CmPmSn.(—Pm, A2)[Sn,(1 + Pm, A1)Qg, (Om) 


= Sn,-1(1 + Pm, @1)Qq,-1(Pm) hom) = 0, (25) 


for no + 1 <7 S ke. It may be shown that the boundary condition (22) 
is redundant, as is to be expected. Thus the constants c,, are deter- 
mined by (24) and (25) only to within a multiplicative constant, which 
is determined from the normalization condition (6). 

From (20), with the help of (16), (19), (57), and (58), it is found that 


ho 
> Di; = coai(0)[Sp,(1, @2)Vo,(0) — Sn,-1(1, a2) ¥,-1(0)], (26) 
j=0 

for 0 <i < k,. But, from (12) and (66), 


ny) 


q1 
(=) si(0, a), 0<i ni, 
a 
ai(0) = (27) 


ni 


ky-t 
(=) Sn, (0, ay), n= is ky. 
1 


Hence, from (26) and (27), with the help of (57), (58), and (65), the 
normalization condition (6) implies that 


Col Sn, (1, a1) 2g, (0) = Sn,-1(1, a;)&%q,-1(0)] 
[8n,(1, @2)%q,(0) — Sny-1(1, @2)¥q,-1(0)] = 1. (28) 


Once the constants c,, have been determined, the steady-state prob- 
abilities p;; may be calculated from (20). We remark that the number 


6 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1981 


of constants to be determined is only ke + 1, whereas the number of 
probabilities p;; is (ki + 1)(Re + 1). 


lil. SOME STEADY-STATE QUANTITIES 

We proceed now to the calculation of various steady-state quantities 
of interest. These quantities are depicted in the diagram of Fig. 1, 
which indicates the mean flow rates. The loss probabilities L; and Lz 
are given by 


ky ky 
I, = y Pky js Lz = » Pikoy (29) 
JEN 1=0 


and the probabilities that a demand from the primary, or secondary, 
source is queued on arrival are 


ki-1 ko ky Ro-1 
Q= > Vp, Q=(1- 8,0) Y Y py. (30) 
i=n, j=0 =0jens 


The probability that a demand arriving from the primary source 


n, SERVERS 


Ry, =A,Q, 


a1 WAITING 
SPACES 
ny SERVERS 


Ra2 =A 2Q9 


gz WAITING 
SPACES 


Fig. 1—Mean flow rates for an overflow system with queuing; Poisson arrival rates 
A, and dg, loss probabilities LZ; and L2, queuing probabilities Q, and Q2, and overflow 
probability ho. 


Ayly 






A (1—Ly hh) 











A2(1-Ly) + Ay bye 
A 2(1-L2—Q,) ‘ 






A2 


Nagle 
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overflows (immediately) is 
No-1 


he = > Phk,,j- (31) 
j=0 


Since the mean service rate is p, the mean departure rate from the 
primary queue to the primary servers is 


ki he 
Ry = Nip y >. Dij; (32) 
i=n,+1 j=0 


while the mean departure rate from the secondary queue to the 
secondary servers is 


ky ko 
Ro = nop (1 - 5q,,0) 3 Ly Pij. (33) 
i=0 J=ny+1 


The average number of demands in the primary and secondary queues 
are 


ky hg ky ke 
Vy, = > » (u = N1)Dij; V2 = y > V7 _ N2)pij. (34) 
i=n, j=0 i=0 Jang 


Also, the average number of demands in service in the two groups are 


bi ie he 
X= YY mint, m)pij, X= YY min(j, n2)piy. (35) 
t=0 j=0 i=0 j=0 


Now, according to Little’s theorem,’ the average number of demands 
in a queuing system is equal to the average rate of arrival of demands 
to that system times the average time spent in that system. If we apply 
this result to the primary and secondary queues, we find that the 
average waiting times of the demands which are queued in the primary 
or in the secondary are given by 

Vi V2 
W,= MQ)’ W.2 = Wa) (ge = 1), (36) 
respectively, independently of the order of service within each queue. 
Also, if we apply Little’s theorem to the primary and secondary groups 
of servers, we obtain 


Ai(1 — Li — tie) = bX), A2(1 — Le) + Ache = bX. (37) 


The steady-state quantities of interest may be expressed in terms of 
the constants c,, with the help of the representations in (20). From 
(29) it is found, with the help of (12) and (16), that 

Ro 
ly = > Cm[Sn,(Oms 1) Xz, (Pm) = Sn,-1(Pm, @)Q2g,-1(Om) ] 
m=0 


*Sno(—Pm, A2)Vq,(Pm). (38) 
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We define 
do = Co[Sn,(1, A2)V_,(0) — Sn,-1(1, a@2)Vq,-1(0)] 
= [sn,(1, a1)Q¢,(0) — Sn,-1(1, a1) %q,-1(0) J’, (39) 


from (28). Then, from (30) it is found, with the help of (26), (27), and 
(65), that 


Q = dosn,(0, 1) [Qq, (0) = 1); (40) 
Moreover, from (29) and (31), it follows that 
Ly + The = dosn,(0, ai), (41) 


and from (32) it follows that Ri: = AiQ,, as is to be expected, since in 
the steady state the mean departure rate from the queue is equal to 
the mean arrival rate to it. 

We define 


ped I Ga De Pere 2) ue A, 
als) 2 amie {i q(qq+l), é=1. a2) 


Then, from (34), with the help of (26), (27), and (39), it is found that 
Vi = dosn,(0, aan (2), (43) 
1 


Also, from (35), with the help of (56), (58), (59), and (65), it follows 
that 


q 
0 Ce een (=) Sn,-1(1, a1) + Sn,(0, @1)[%q,(0) — uf. (44) 
1 


It may be verified, with the help of (39), (57), and (65), that (41) and 
(44) are consistent with (37). The explicitness of the expressions for 
the quantities in (40), (41), (48), and (44) is due to the fact that these 
quantities are not affected by the secondary. This, of course, is not the 
case for the loss probability Li, which is given by (38). 

Next, from (29), since ¢x,(p) = 1, it is found, with the help of (20) 
and (68), that 

ko 

Lz = d, CmSnz(—Pm, G2)[Sn,(1 + Pm, A1)Q2q,(Om) 


— Sn,-1(1 + pm, @1)Qq,-1(Pm)]. (45) 
Also, from (35), with the help of (6) and (59), it follows that 


he 
X2 = NnN2—- » CmSn,-1(2 — Pm, Q2) on, (Pm) 


m=0 


*[Sn,(1 + Pm, @1)Qg,(Om) — Sn—1(1 + Pm, A1)Mq,-1(Pm)}. (46) 
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In view of (38), (41), (45), and (46), the second relationship in (37) 
provides a useful numerical check. 


We now define 
ky 


r= > Piss n2 Sj] Sho. (47) 
i=0 
If we sum on 7 in (5), we obtain 
[a2(1 — Sjr,) + nelrj = aerj-1 + no(1 — Sn) rj+1, (48) 
for no +17 S kz. It follows that 
Ne; = AePj-1, ng+1sjs ho. (49) 


Hence, since Lz = rz,, from (29) and (47), 


Ro-J 
r= (=) La, N2 <j < ho. (50) 


Then, from (30) and (33), with the help of (63), we obtain 
Q2 = [W.,(0) — 1]Le, (51) 
and R22 = A2Qz2, as is to be expected. Also, from (34) and (42), it follows 
that 
Ve = a, (=) Le (52) 
a2 


This completes the calculation of expressions for the steady-state 
quantities of interest. 
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APPENDIX 


We define s;(A, a) by the recurrence relation 
(a+itA)s (A, a) = a(1 — dio)si-i(A, a) + (2 + 1)5i41(A, @); 
SoA, a) = I, (53) 


for i= 0,1, ---. Thus s,(A, a) is a polynomial of degree n in both A and 
a, and it may be related to a Poisson-Charlier polynomial.’® However, 
we give here the properties of s,(A, a) which we will need. An explicit 


formula is’ . . 
si(d, a) = A)raT (54) 


rorlti-r)V 
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where 
(A)o = 1, (A); =AAA+1)--- Atr—-1), r=1,2,-+:. 
It was also shown’ that 
(t + 1)si4,(A, a) = as;(A, a) + As;(A + 1, a) 
and 
si(A, a) = s(A + 1, a) — (1 — Bio) 5:-1(A + 1, a). 
From (57) it follows that 


n 


> si(A, a) = SnlA + i; a), 


i=0 


and, from (56) and (58), we deduce that 


) (n = t)si(A, a) = (1 ~~ 8no)Sn—1(A + 2 a). 
i=0 


(55) 


(56) 


(57) 


(58) 


(59) 


We now turn our attention to the Chebyshev polynomials of the 
second kind,” U;(x). They may be defined by the recurrence relation 


2xUi(x) = Unsi(x) + Ui-i(x); U_\(x) = 0, Ud(x) = 1, 
for /= 0,1, ---. From (15) and (60) it follows that 
(az + Ne — p)Vi(p) = aeVi41(p) + noV-1(p), 


Valp)=0, Wolp) = 1. 


From (16) and (61) we deduce that 
[a2(1 — dj2,) + n2 — plo) = aegj-1(0) + no(1 — 8j2,)bj+1(0), 
for 7 = ke. Also, from (61), it may be shown by induction that 
¥ (0) = Y (2) 
r=0 \Q2 
Next, from (11) and (60) it follows that 
(a) + my + p)QXi(p) = aQii(p) + miQr-1(p), 
Q-i(o) =0, = Qo(p) = 1. 
In particular, it is found by induction that 


l r 
(0) = ¥ (2). 


r=0 
Since s;(0, a) = a'/i!, from (54), it follows that 
Sn, (0, )&2;(0) = Sn,-1(0, @)Q2;-1(0) = Sn, (0, a), l= 0, 1, eoe, 
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(60) 


(63) 


(66) 


11 


Next, from (9) and (10), we deduce that 


hy 
p Py ai(p) = (11 + p)ax,(p) — arar,-1(p). (67) 


Then, with the help of (12), (56), (57), and (64), it may be shown that 


hy 


Y @i(o) = Sn,(1 + p, a1)Qq,(0) — Sn,-1(1 + p, @1)Q2q,-1(0), (68) 


i=0 


for p # 0. Moreover, this result holds for p = 0 also, from continuity. 
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The need to assess attendance behavior often arises, at the line- 
management level, when an employee is considered for a transfer or 
a promotion. A sound assessment should, of course, take into account 
the statistical behavior and distributional properties of absenteeism. 
The first part of this paper is a detailed statistical analysis of 
attendance records of a sample of 112 telephone operators. We use 
exploratory and confirmatory statistical techniques to suggest posst- 
ble theoretical models that can parsimoniously describe the behavior 
of the variables of interest. Methodological difficulties that often 
arise in cross-sectional studies and are caused by biased sampling 
are pointed out and treated. We explore the relation between age and 
attendance; in particular it is evident that (for this data set) the 
frequency of “incidental” absences tends to decrease with age, and 
that the duration of “disability” absences tends to increase with age. 
In the second part of the paper we suggest an attendance evaluation 
method based on the statistical analysis of the first part. The method 
is designed to reflect the current-year attendance as well as a longer- 
run attendance behavior, interpreted as a personal characteristic, 
and its properties are demonstrated via examples. 


l. INTRODUCTION AND SUMMARY 


Management policy regarding absenteeism has two major aspects: 
a global one spelled out in the various company rules and applied 
evenly to all employees and a local one, generally less formal, in which 
line management is concerned about individual’s attendance. A ques- 
tion like how many “paid days off” per year an employee should be 
allowed for unexpected and unavoidable absences is often a subject for 
union negotiations and is a good example of what we mean by man- 
agement’s global policy. On the other hand, the need to decide whether 
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a given operator has exhibited satisfactory attendance arises when 
that operator is considered for a transfer or promotion and is a good 
example of management’s local policy. Whether local or global, a sound 
policy should consider the statistical characteristics and the distribu- 
tional properties of absenteeism. 

Section II gives a detailed statistical analysis of absenteeism (on the 
basis of a sample of 112 telephone operators). Such an analysis can 
enhance our understanding of absenteeism, and can be used as a basis 
for answering questions of the type described above. For example, the 
distribution of the duration of incidental absences (Section 2.6) and 
the frequency of incidental absences per year (Fig. 6, or more generally 
Section 2.6), can be used to answer how many paid days off per year 
an employee should be allowed. An answer based on such a statistical 
analysis is more likely to satisfy the true needs of the average employee 
than any decision which makes no reference to the distributional 
properties of absenteeism. (Note that the Bell System’s allowance for 
personal time started after our data were taken.) 

In Section III we suggest a method for assessing absenteeism, based 
on our statistical findings of Section II, and discuss its properties. The 
analysis of Section 2.4 indicates that one year is too short a period to 
decide whether an operator is intrinsically “good,” “bad,” etc., regard- 
ing attendance. Thus, if management is interested in assessing at- 
tendance as a personal characteristic, the follow-up period needs to be 
longer than one year. The conflict between the viewpoint that past 
years’ attendance should not affect the present evaluation (for any 
type of performance rating), and the statistical observation that one 
year is too short a period to assess attendance, are resolved by basing 
our evaluation method (Section III) on two indices. One index rates 
the current year attendance, while the other index rates attendance 
behavior as a personal characteristic, and it depends on the attendance 
during the three most recent years. 

Various aspects of absenteeism have been studied in recent years 
(particularly in the fields of labor relations, applied and industrial 
psychology, and management science). The major contributions of our 
paper to this area of research, and the relation to other studies, as we 
see them, are summarized below: 

(t) We suggest an intuitively appealing method for assessing absen- 
teeism, which reflects the current year attendance, as well as atten- 
dance behavior, as personal characteristics. With suitable modifica- 
tions, the method is adaptable to other occupations. 

(ii) Often in cross-sectional studies a certain sampling bias is intro- 
duced because the sampling is done along the time axis. The detailed 
analysis of Section 2.4 shows how to identify this bias (and in some 
instances how to estimate the underlying model in the presence of this 
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bias). This technique can be of use to other researchers analyzing 
cross-sectional data. (A more detailed paper devoted entirely to statis- 
tical questions that arise in the analysis of this type of data is forth- 
coming.) 

(iit) In the course of our analysis in Section IH, we use some 
graphical techniques that are common tools in exploratory data anal- 
ysis, but are not yet familiar to most social scientists. These tools are 
useful in the tedious chore of identifying patterns and models in large 
data sets, and we hope that exposing them to researchers in the social 
sciences will help make them popular. 

(tv) Throughout the paper we distinguish between two types of 
absences, disability and incidental (definitions in Section II). This 
classification enables us to shed some light on the relation between 
absenteeism and age. Several authors have tried to relate absenteeism 
to age and conflicting findings are often reported. Indeed, in a recent 
study based on a survey of blue-collar production workers, Nicholson 
et al. (Ref. 1, pp. 319-320) report on a marked inverse relation (espe- 
cially for male employees) between absence frequency and age which, 
as they point out, contrasts the conclusions of Porter and Steers? (a 
review of the literature on the subject of absenteeism and turnover) 
and Cooper and Payne,’ that absence frequency increases with age. 
Our data suggest that for telephone operators (all of whom in our 
sample are females) the truth lies somewhere in the middle. That is, 
the frequency of incidental absences is higher for younger operators, 
while the frequency and duration of disability absences is higher for 
older operators. 

For readers who are interested in aspects of absenteeism that are 
not directly related to this work (such as economic, psychological, 
etc.), we include a supplementary reference list (which is by no means 
complete). 


il. DATA ANALYSIS AND STATISTICAL MODELING 
2.1 Introduction 


We distinguish between two types of absences: incidental absences 
(1A), which are usually short, more frequent, and (to a certain extent) 
controllable, and disability absences (DA), which are usually long, less 
frequent, and uncontrollable. Formally, a DA is any absence that lasts 
six or more days and is due to an illness (an exception is an on-the-job 
accident in which case the DA period can be shorter than six days); 
any other absence is defined as an IA. Periods of attendance at work 
will be referred to as showing up (SU) periods. 

Our data are made up of attendance records of 112 New England 
Telephone operators, for variable periods f,, ---, fi12. Out of the 112 
records, 6 cover approximately 1 year (between 0.8 and 1.4 years), 63 
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cover approximately 2 years (between 1.6 and 2.4 years), and 43 cover 
approximately 3 years (between 2.5 and 3.1 years). Here we take a year 
to be 240 working days. The attendance records in our sample do not 
usually start, or end, at a beginning of a DA, IA, or SU period and thus 
two censored (i.e., incomplete) periods typically exist (these are usually 
su periods) for each of the 112 records, one at each end of the record. 
This situation is demonstrated in Fig. 1, which gives a schematic 
example of an attendance record in our data. Note that holidays, 
weekends, vacations, etc., have been deleted from the time axis. The 
large proportion of censored su periods, among the total number of su 
periods, requires special attention and leads to an interesting analysis. 

Frequency of absences, duration of absences, duration of su periods, 
relations between absence and age, etc., are all parts of the complete 
picture of “attendance behavior” of operators. We analyze these vari- 
ables below. In cases where our analysis suggests possible theoretical 
models that can adequately describe the behavior of the variables in 
question, we point out these models. 

Our analysis suggests that operators older than 35 are different from 
operators younger than 35 with regard to certain aspects of absence 
behavior; for the sake of brevity, we refer to the first group as older 
operators and to the second group as younger operators. 


2.2 Duration of IA’s 


A histogram of the duration of the 560 observed 14’s is given in Fig. 
2a. A simple theoretical model that fits these data to a remarkable 
degree of accuracy is 





. ae, a ee Pp if j7=1, 

P{duration of IA =7] = P; hi yee ee eee (1) 
with p = p = 346/560 = 0.6179 (note that if x1, ---, x, is a random 
sample with a probability density function (pdf) (1), then the maxi- 

STATE 

DA — 

IA -- _ —_— — 

Xx, Xo Xn 
SU pee oe mee Cameos — ee nr a oe ee 
TIME 
0 = (WORKING 
BEGINNING DAYS) 
enPlo yes SSS SS SS 
START END 
OF OF 
SAMPLE SAMPLE 


Fig. 1—A schematic description of an attendance record. Note that the first and last 
su periods (X, and X,) are censored (only X; and X;, are recorded in the sample). 
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346/560 


OBSERVED FREQUENCY 





Uy 560 
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Yi ICTTZD. 
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ae 
(a) 


DURATION OF IA (DAYS) 5 6 OR MORE 
OBSERVED FREQUENCY 0.6179 0.1982 0.0911 0.0518 0.0321 0.0089 


P; ACCORDING TO (1) 0.6179 0.1911 0.0955 0.0478 0.0239 0.0239 
DIFFERENCE 0 0.0071 -0.0044 0.0040 0.0082 -0.0150 


DURATION OF IA (DAYS) 4 5 OR MORE 


OBSERVED COUNT 
EXPECTED COUNT, 560 x Pj 
DIFFERENCE 





(b) 


Fig. 2—(a) A histogram of the durations of 14’s (avg = 1.72, stdv = 1.16). 
(b) Comparison between the pdf of (1) and the observed durations of 1A’s. 


mum likelihood and the minimum variance unbiased estimator of p is 

= Wh I[X; = 1]/n, where I[{ | denotes the indicator function; in our 
case this gives p = 346/560). Figure 2b compares the model of (1) with 
the observed data, and the adequacy of the model is transparent. 
Nevertheless, it is interesting to note that a chi-square goodness of fit 
test with size a does not reject the hypothesis that the durations of 
1A’s have the pdf (1), even when a is as high as 0.90! 

A random variable, say X, with the pdf (1) has the following 
interesting property: 


P[X=kh+j|X>k]=2% j=1,2,---, R=1,2,---. (2) 


The interpretation of this property, when X stands for the duration of 
an IA, is the following: On the second day of an incidental absence, the 
employee tosses a coin; if the result is heads the employee returns to 
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work on the next day, otherwise he remains absent. The experiment is 
repeated daily until the first time the result is heads, in which case the 
employee returns to work on the following day. Should one try to 
interpret this interesting property, exhibited by the data, in terms of 
human behavior in regard to short absences? 

We remark that the distribution of the duration of 1A’s for younger 
operators is approximately the same as for older operators and neither 
deviate much from (1). 


2.3 Duration of DA’s 


The following are summary statistics for the 78 DA occasions in our 
data: 


lower quartile = 9.0, median = 13.5 upper quartile = 41.0, 

mean (with six most extreme observations removed) = 26.1, 

standard deviation (with six most extreme observations removed) 
= 24.2. 


Out of the 78 observations, 18 were incurred by operators younger 
than 35. Figure 3 compares, by means of box plots,* the distributions 
of the duration of DA’s in the three different cases; younger operators 
(18 DA occasions), older operators (60 DA occasions), and the combined 


DAYS 





AGE = 35 AGE > 35 COMBINED SAMPLE 
(18 OBSERVATIONS) (60 OBSERVATIONS) (78 OBSERVATIONS) 


Fig. 3—Box plots for the duration of DA’s. 
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sample (78 DA occasions). The lower and upper sides of each box are 
the lower and upper quartiles, respectively, and the segment inside the 
box is the median. If d denotes the distance between the quartiles, 
then the box whiskers are drawn to the nearest data value with 1.5d 
from the nearest quartile. Points lying outside this range are plotted 
individually. The figure suggests that long DA’s are more frequent 
among the older operators. For instance, the upper quartile for the 
older operators is 44.0, with the most extreme observation being 165, 
while the corresponding figures for the younger operators are 32.0 and 
44.0. This difference cannot be accounted for by differences in the total 
sampling durations, because the distribution of the ¢;’s is approximately 
the same for the two age groups. 


2.4 Duration of SU periods 


We use Fig. 1 as a vehicle to explain some basic concepts regarding 
the censoring of su periods. Suppose the length of each individual su 
period (X; in Fig. 1) is distributed according to the cumulative distri- 
bution function (cdf) F(u). Then, since the probability of any individ- 
ual period that covers the point f is directly proportional to its length 
u, the distribution function of the length of the interval that covers fo 
(Xi, in Fig. 1) is H(x) = fo udF(u)/m (m is a normalizing constant that 
is equal to the mean of F’). Given X;, however, the distribution function 
of X1, which is the observable part of Xi, is uniform on the interval [0, 
X;] so that (using Bayes’ theorem) the unconditional distribution of 
Xj is 


G(y) = P[Xis y]=m™ | [1 — F(u)] du. (3) 
0 


Since (3) is usually derived in the context of renewal processes, in 
which case an assumption about the independence of different X;’s (SU 
periods in our application) is built in, it is important to note that this 
assumption is not used in the derivation of (3) (cf. Ref. 5, p. 66), and 
therefore it is not assumed in our discussion. In the analysis that 
follows, however, we assume (unless otherwise stated) that su periods 
of different operators have the same cdf F’, as long as they are in the 
same age group. 

It is clear that the argument leading to (3) applies also to X;,, so that 
(3) is the distribution of the censored su periods. An important 
property of (3) is 


G=F if, and only if, F is an exponential distribution 
fie, F(x) =1l-—e*", x>0, p>O]. (4) 


Or, in words, 
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the censored su’s and the completed su’s have the same 
distribution if, and only if, the distribution 
of the su’s is exponential. (4’) 


2.4.1 Analysis for operators younger than 35 


For the younger operators, Fig. 4a compares censored sv’s with 
completed su’s, by means of a Q-Q plot (see Ref. 6, chapter 6). The 
deviation of the plot from a 45-degree line through the origin is not 
very large and for practical purposes one can assume that the censored 
su’s and the completed su’s follow the same distribution. Being more 
formal, if we test the hypothesis that the two samples have the same 
distribution, using a Wald-Wolfowitz runs test, we observe 92 runs 
while the mean and standard deviation under the null hypothesis are 
99.0 and 6.0, respectively, so that the hypothesis is not rejected at 
significance levels of 0.12 or less. Thus, from (4’), we are led to the 
conclusion that the distribution of the su’s is exponential (or, at least, 
that this is an adequate description of the data). Figure 4b is a 
comparison of the combined su sample (censored and completed) 
versus quantiles from exponential distribution. The striking closeness 
to linearity of this plot strongly supports the conclusion that the su’s 
are exponentially distributed. The estimated mean of the combined 
sample is 60.2 days, and the standard deviation is 61.2 (which is very 
close to the mean, as is to be expected from a sample from exponential 
distribution). In summary, for the purpose of fitting a parsimonious 
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CENSORED SU's IN DAYS (65 OBSERVATIONS) 
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COMPLETED SU’s IN DAYS (199 OBSERVATIONS) 


Fig. 4a—Q-Q plot of censored su’s (Y axis, 65 observations) versus completed su’s 
(X axis, 199 observations), for operators younger than 35. 
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Fig. 4b-—Q-@ plot of completed and censored su periods (Y axis, 264 observations) 
re eee — u) (quartiles of standard exponential distribution), for operators younger 
an 35. 


model, one can assume that the periods between consecutive absences, 
for operators of age 35 or less, follow an exponential distribution with 
mean = 60 days. 


2.4.2 Analysis for operators older than 35 


Applying a similar analysis as in the previous case, we point out an 
interesting “data paradox” exhibited by the two su samples (censored 
and completed), and we give possible explanations for this paradoxical 
behavior of the data. 

Figure 5a compares the censored su’s with the completed su’s. The 
deviation of the Q-Q plot from the 45-degree line through the origin 
is marked, and it is evident, therefore, that the completed and the 
censored su’s have different distributions. Specifically, the censored 
su’s appear to be stochastically bigger than the completed su’s (the 
Q-@ plot is on or above the 45-degree line through the origin) and for 
comparison we look also at their summary statistics: 


(lower quartile, median, upper quartile, mean, stdv) = 
(20.0, 46.0, 88.0, 66.3, 71.2) for the completed su’s, and 
(21.0, 47.0, 206.0, 113.3, 130.4) for the censored SU’s. 


In view of this situation and the assumption that su’s of different 
operators have the same cdf, (4’) suggests that the distribution of the 
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Fig. 5a—Q-@Q plot of censored su’s (Y axis, 142 observations) versus completed su’s 
(X axis, 333 observations), for operators older than 35. 


completed su’s cannot be exponential. Figure 5b, however, in which 
we compare the completed su’s with exponential quantiles, points in 
the opposite direction. The closeness to linearity of the Q-Q plot (with 
the exception of the upper 17 points) suggests that the completed su’s 
do follow an exponential distribution (or perhaps an exponential with 
a 5-percent contamination). 

We give two possible explanations to this data paradox. The first is 
that intrinsic differences in absence behavior might exist among the 79 
operators of age greater than 35, so that any attempt to fit a single cdf 
to the su periods of these operators is meaningless [in mathematical 
language this means that, in eq. (3), different operators are associated 
with different distribution functions F, while we try to fit a single FJ, 
and a more complicated model is needed. One possible model is that 
operators can be naturally classified into classes according to their 
attendance behavior (good, bad, etc.). Nevertheless, the sampling 
periods (¢,’s) in our data are not long enough to enable us to decide 
whether a given operator is intrinsically good, bad, etc., and thus we 
have not pursued this model. 

In the second possible explanation, we show that a certain sampling- 
bias effect could have been the source of our data paradox. Suppose 
the cdf of su periods, which is F of eq. (3), is a mixture of two cdf’s, an 
exponential cdf with mean A, which is small relative to the sampling 
periods ¢;, and a degenerate cdf which assigns a unit mass to a point B 
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Fig. 5b>—Q-Q plot of completed su’s (Y axis, 333 observations) versus —log(1 — uw) 
(quartiles of standard exponential distribution), for operators older than 35. 


which is big relative to the ¢;’s. This means that over a long period of 
time, a certain proportion, say a, of the su’s have an exponential 
distribution, while the other su’s last a fixed length B. Now any 
sampling period of length ¢ satisfying A « ¢ < B cannot possibly 
contain a completed su period of length B, so that all the completed 
su’s must be from the exponential population and only censored sv’s 
could possibly be from the B population. In addition to being a model 
that accommodates the “paradoxical” behavior of our data, this model 
provides a useful framework for estimation. Under the model’s as- 
‘sumptions 


P[su > x] =1— F(x) = ae~*“ + (1 — a) I [x < B], (5) 
where I[ ] denotes the indicator function, so that, using (5) and (8), 
the moments of the censored su’s are 
E[censored su |” = | X” dG(x) 
0 
_ alin + 1)!A"*? + (1 —a)B" 
(aA+(l1—a)B)(n+1) ’ 


Since our model assumes A < ¢ we have, to a good approximation, 


n=0,1,--+. (6) 


t 
E[completed su]" = | x"Ate*/4dx~niA", n=0,1,-+-, (7) 


0 
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and therefore the moments method, applied to (6) and (7), gives 


E[completed su] = 66.3 =A, 
q2 _ 1) p2 
E[censored su] = 113.3 = CAS OP: (8) 


~~ (aA + (1 —a)B)’ 
6aA? + (1 — a)B? 
3(aA + (1 —a)B)’ 


which yield the estimates A = 66.3, B = 500.0, & = 0.96. 

Though the above model accommodates the type of behavior dem- 
onstrated by our data, so do other models based on a contaminated 
exponential distribution and the question of finding a model that fits 
our data well has not been answered yet. Toward this end we derived 
a nonparametric estimate of F, denoted F, by tailoring the Kaplan- 
Meier estimator’ to our application, in which each completed obser- 
vation has to be counted with multiplicity two. (The exact details of 


E[censored su]* = 29841.05 = 


PROBABILITY 
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Fig. 5c—The dotted line is F(x) = Poiaer[SU <= x] (using a modification of the Kaplan- 


Meir estimator, Section 2.4). The solid line is the contaminated exponential distribution 
of eq. (1). 
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this estimator will be discussed in a separate paper.) The result is 
given in Fig. 5c. The contaminated exponential model 
6 


F(x) = 0.94(1 — e7*/) + 006(=5) I[0<x<540], x=0, (9) 


is superimposed on this figure, and it seems to fit the data rather well. 
Figure 5d compares (9) with F(x) by means of a Q-Q plot, and the 
closeness of the plot to the 45-degree line reassures us about the 
adequacy of the model. 

The behavioral interpretation of (9) is that usually (i.e., 94 percent 
of the time) the duration of su periods follows an exponential distri- 
bution with mean 70 days, while occasionally (i.e., 6 percent of the 
time) an su period can be much longer (perhaps 500 to 540 days). We 
note that the average SU period, according to (9), is approximately 94 
days, which is substantially bigger than the corresponding number for 
the younger operators (60 days). 


DAYS 





0 100 200 300 400 500 600 
DAYS 


Fig. 5d—A Q-Q plot of F(x) = Pruser[su S x] (using a modification of the Kaplan- 
Meir estimator, Section 2.4) versus the contaminated exponential distribution of 
eq. (10). 
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An important aspect of our data-paradox, and the contaminated 
exponential model, is that it indicates that a follow-up period of one or 
two years is not sufficiently long for evaluating the attendance behavior 
of operators. This observation has implications to our discussion of 
evaluation procedures. 


2.5 Frequency of absences and total time lost (TTL) due to absences 


Figure 6 gives box plots of the frequency of 1A’s (occasions per year) 
for the two age groups and for the combined sample. (The nonoverlap- 
ping of the notches in the first and second boxes indicates a difference 
at the rough 5-percent significance level between the two medians.*) 
One can immediately see that younger operators tend to have substan- 
tially more 1a’s. Note again that this difference cannot be accounted 
for by differences in the total sampling durations, because the distri- 
bution of the ¢;’s is approximately the same for the two age groups. 

The situation regarding DA’s is somewhat reversed, as one can see 
from Tables I and II. For example, while the proportion of the younger 
operators in the sample is 29 percent, the proportion of the DA 
occasions incurred by them is only 23 percent. We also see in Table II 
that the ratio “TTL due to DA” to “TTL due to IA” is 0.022/0.021 for 
younger operators, while it is 0.045/0.014 for older operators. This, 
plus the fact that the probability distribution of the pa duration for 
older operators has a substantially longer tail than the corresponding 
quantity for younger operators (Fig. 3), explains the fact that the TTL 
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Fig. 6—Box plots for the frequency of 1a (occasions per year). 
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Table I—DA occasions and age 


No. of No. of 
Individuals No. of Operators 
with Occasions Occasions in the Entire 
of DA of DA Sample 
Age = 35 13 (25%) 18 (23%) 33 (29%) 
Age > 35 40 (75%) 60 (77%) 79 (71%) 
53 (100%) 78 (100%) 112 (100%) 


caused by absences is somewhat higher for older operators (5.9 percent) 
than for younger operators (4.2 percent). 

The first line of Table II shows that a period that starts at the end 
of the 1A and ends at the end of the following IA (including the possible 
DA’s) lasts, on the average, 72 days for younger operators, and 107 days 
for older operators. 


2.6 The number of IA occasions (IAO) over a fixed period of time 


For later applications we want to derive an estimate for the proba- 
bility distribution of the number of 1A0 over a fixed period of time, for 
an arbitrary operator in our sample. To keep the analysis and the 
presentation simple we ignore, for the time being, the differences 
between younger and older operators. Later we will comment on the 
corresponding analysis when the difference in attendance between the 
two age groups is taken into account. It is well known (e.g., Ref. 5, 
p. 104) that under fairly weak assumptions about the statistical behav- 
ior of the periods between consecutive IA’s, the quantity Vi[N (t)b/t] 
has a limiting distribution as t > o, Here N(¢) denotes the number of 
IAO over a time period of length ¢ and b is the average number of 1A0 
per unit time. We take this theoretical model as a framework for 


Table II—Age comparison of certain absence characteristics (DA = 
disability absence, IA = incidental absence, TTL = total time lost) 


Combined 
Age = 35 Age > 35 Sample 
ta on Baas —_ 0.078 = = 0.148 oa = 0.122 
Solal sacapling periods Ta OM gee 0014 aan = 0.016 
Talal Siampline periods aan = 0.022 ang = 0.045 nia = 0.038 
coral sannpling porfods Tea M2 ere 0059 | = 0.054 


* Total sampling periods = the sum of all the observation periods across operators 
(i.e., sum of the t’s of Fig. 1). 
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NUMBER OF IA OCCASIONS, N(t;) 





0.5 1.0 1.5 2.0 2.5 3.0 3.5 
SAMPLING DURATION (t)) IN YEARS 


Fig. 7—A scatter plot of the number of 14 occasions, N(¢;) (Y axis) versus the 
sampling duration, ¢; years (X axis), for the 112 operators. 


producing estimates of the distribution of N(t) for a given ¢t. A scatter 
plot of (¢;, N(t;)), i = 1, --+, 112, is given in Fig. 7, and one can see 
immediately that VAR(N(t)) increases with ¢ (this is usually referred 
to as heteroscedasticity), as to be expected from our model. Note that 


10 


—2 


| 
je 


t; < 2.5 YEARS t; 2 2.5 YEARS 


4 











Fig. 8—Comparison of u; = Vti.N (t;)/t: — 6] for small and large values of t;. 
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our model implies that the mean and the variance of N(t) are approx- 
imately linear in ¢, for large values of ¢. The regression estimate of 6 in 
the model 


N(ti)/Vti = b Viti + U; (10) 


is 6 = 2.24 (with a ¢ value of 15.36). Figure 8 compares (by means of 
box plots) the residuals, U;’s, of the regression (10) for periods of 
length ¢; < 2.5 years with the U;’s for periods of length t; = 2.5 years. 
The choice of ¢ = 2.5 as a cutoff point seems natural from the 
distribution of ¢t’s (see second paragraph of Section 2.1); other ¢’s in 
the neighborhood of 2.5 gives similar results. The comparison shows 
that the heteroscedasticity of the data (Fig. 7) is eliminated and the 
U;’s can be considered as random variables satisfying VAR U; = 
a2 > 0, independent of t, so that the empirical distribution of the U;’s 
can be used to estimate the distribution of N(t). 

Theoretically, if (¢) our assumptions (e.g., all operators behave 
according to the same probability law) were completely realistic and 
(it) ¢ were very large, then the distribution of U would be close to a 
normal distribution and we could use this fact to estimate the distri- 
bution of N(t). Since, however, neither (7) or (ii) is entirely correct, 
we do not rely on the asymptotic normality of U. Instead we use the 
empirical distribution of the U;’s as an estimate of the distribution of 
U, and hence obtain an estimate for the distribution of b¢ + VtU. In 
practice, however, since N(¢) is restricted to the nonnegative integers, 


Table III—P[N(t) = j], 
estimated probability that 
the number of IA 
occasions, over a period 
of length t, equals / 


» t lyear 2 years 3 years 


0.29 0.09 0.01 
0.13 0.16 0.08 
0.14 0.11 0.08 
0.12 0.07 0.09 
0.17 0.11 0.11 
; 0.07 0.07 

0.03 0.10 0.07 

0.02 0.12 0.10 

0.02 0.07 0.05 

0.0 0.04 0.06 
10 0.0 0.02 0.13 
11 0.0 0.02 0.05 
12 0.0 0.02 0.04 
13 0.0 0.0 0.02 
14 0.0 0.0 0.02 
15 0.0 0.0 0.02 


COOBNMHILwWNe © 
oO 
> 
oo 


Total 1.00 1.00 1.00 
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we look at 
m(t) = max{0, [6t + VtU; + 4]}, (11) 


where [x] denotes the integer part of x, and we estimate the distribu- 
tion of N(t) by 


P,(j) = P[N(t) =7] = (number of m;(t) = 7) /112. (12) 


Table III gives P,(/) for ¢ = 1, 2, 3, years. 

Comment: In view of the difference between younger and older 
operators, it would have been more appropriate to estimate P[N(t) 
= J] separately for younger and for older operators, and then to use 
their relative weights in the entire sample to obtain a final estimate. 
That is, 


5 é 79 . 
Pyoungerl N(¢) = J] + => Potder[ N(t) = 7]. 


2 : 33 
PIN() =J) = 755 2 


2 


The actual values of the estimates using this method are close to the 
values of the estimates we obtained (Table III) without partitioning 
the sample, and therefore we do not give the details of this calculation. 
Note that the estimates of (12) are motivated by a model that imposes 


Table IV—P[L(t) <j], 
estimated probability that 
the TTL from IA’s, over a 

period of length t, is at 

most j/ days 


x l year 2 years 3 years 


0.29 0.09 0.01 
0.37 0.19 0.06 
0.45 0.26 0.11 
0.52 0.32 0.16 
0.60 0.38 0.21 
0.68 0.43 0.26 
0.75 0.48 0.31 
0.81 0.53 0.36 
0.86 0.58 0.41 
0.90 0.63 0.46 
10 0.93 0.68 0.50 
11 0.95 0.73 0.54 
12 0.97 0.77 0.58 
13 0.98 0.81 0.62 
14 0.99 0.84 0.66 
15 1.00 0.87 0.70 


WONMHMTLWNeH OO 


16 0.89 0.74 
17 0.91 0.78 
18 0.93 0.81 
19 0.95 0.84 
20 0.96 0.87 
21 0.97 0.90 
11 0.98 0.92 
12 0.99 0.94 
24 1.00 0.96 
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very few assumptions on the data. Other possible frameworks for 
estimation, which impose more conditions on the data (e.g., Poisson 
arrivals of the IA’s) result in estimates for which we feel that the 
assumptions, rather than the data, determine the actual values of the 
estimates. However, with more absenteeism data (in particular, longer 
t;’s) it is possible to identify a useful parametric model for estimating 
P(N(t) =J]. 
Let L(t) denote the TTL from the N(¢) occasions of 1A. Clearly 


L(t) =X, + Xot--- + Xnit), (13) 


where X; denotes the duration of the ith 14. Assuming that the X;’s are 
independent of N(t), we have 


P{N(t) =n, L(t) -n=P[3x=7] PMO =n). (14) 
1 


Combining the estimates (12) and (14) with (15), we obtain the joint 
probabilities of N(t) and L(t) for ¢ = 1, 2, 3 years. The marginal 
distributions of N(t) and L(t) (Tables III and IV, respectively) are 
then used to construct Tables Va, b, and c, as described below. 


2.7 Constructing Tables Va, b, andc 


Tables Va, b, and c are the building blocks of our proposed evalua- 
tion procedure (Section III) and understanding their construction 
enables the user to interpret the ratings R., Ry, and R, which make up 
the attendance evaluation scheme. 

To each possible value of N(1), the number of IA’s in a single year, 
and to each possible value of L(1), the total number of days lost in 
these IA’s, we attach a grade and a score. Values of N(1) which lie in 
the lower 5 percent of the distribution of N(1), which is given in Table 


Table Va—Scoring table on the basis of one-year attendance 


Number of Occasions 


‘ 0 1 2 3 4 5 6 
0 100 E 100 
1 74 G 74 
2 68 «55 G 62 
ale 60 49 43 F 49 
Bl 4 56 460 46——i(itid—t~*«é«*D F 43 
S| 5 5243 37 30 ~~ 3 F 37 
S| 6 48 39 34 27 21 0 F 31 
5] 7 42 34 30 24 18 0 pP 24 
2/8 383 97 #2927 0 P 20 
gl 9 34:8 24 20 «15 0 Pp 16 
Z| 10 30-24 21 17 «18 0 pP 12 
11 24 20 17 4 ol 0 P 8 
12 0 0 0 0 0 0 U 0 

E G F F P POU 

100 74 #42449 37 ~ 24 14 0 
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Table Vb—Scoring table on the basis of two-years attendance 


Number of Occasions 
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Number of days 
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III, are given the grade Excellent and their scores vary between 100 
and 95; values of N(1) which lie between the 6th and the 25th percentile 
of the distribution of N(1) are given the grade Very Good and their 
scores vary between 94 and 75, etc. [The particular score depends on 
how many values of N(1) fall in this range. For example, if only one 
value of N(1) lies between the 6th and 25th percentile, its score is 94 
(e.g., the rightmost column of Table Vb); if there are two values, their 
scores are 94 and 84 = 94 — ('4)(94 — 75) (e.g., the lower-most row of 
Table Vc); if there are three values, they get the scores 94, 88 = 94 — 
(44)(94 — 75) and 81 = 94 — (%)(94 — 75), and so on.] We treat L(1) 
similarly, using the estimated distribution in Table VI. Table VI gives 
the details of the grading and scoring method, and it is used for N(¢) 
and L(t), ¢ = 1, 2, 3. 

An exception to Table VI is made when N(¢) = 0 [and hence L(t) 
= 0], in which case the grade is Excellent and the score is 100 regardless 
of whether 0 is in the lower 5 percent of the distribution of N(t) [note 
that P(N(1) = 0) = 0.29 and P(N(2) = 0.09); see Table III]. The scores 
(and grades) associated with each value of N(1) and L(1) are written 
on the margins of Table Va and each entry in the body of the table is 
the geometric mean of the marginal scores; for example, R,(N(1) = 3, 
L(1) = 4) = V387 x 48 = 40. The reason for choosing the geometric 
mean to combine the marginal scores is to achieve the desirable shape 
of the equicontours of the resulting table. More specifically, we observe 
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the following attractive properties: (1) The ratings decrease along the 
west-east and north-south directions. (ii) Each entry in the table is 
slightly greater than (or equal to) its north-east neighboring entry. 
(This implies that a reduction in the number of occasions of 1A’s is 
desirable even at the expense of a slight increase in the TTL.) (iiz) An 
operator is rated Unsatisfactory whenever at least one margin is rated 
as such. 

Tables Vb and Vc are constructed similarly with the obvious sub- 
stitutions of (N(2), L(2)) and (N(8), L(8)) for (N(1), L(1)). 


Il. EVALUATING ATTENDANCE AT WORK 
3.1 Introduction 


The attendance behavior of an operator is one of the most important 
components in the operator’s overall performance, so it is evaluated 
regularly. In particular, it is weighed very carefully when the operator 
is considered for a transfer or promotion. So far, however, attendance 
has been assessed in local terms (compared to other operators in the 
office) and naturally this is done in a subjective and informal way. 
Though the ‘informality is an advantage both for management and 
employees, this is not so for the subjectivity of the evaluation. A 


Table Vc—Scoring table on the basis of three-years attendance 


Number of Occasions 


n 0 1 2 3 4 5 6 7 8 9 10 Il 12 18 
100 100 
94 94 
91 86 89 
89 84 79 84 


86 81 76 72 

83 79 74 70 66 

81 77 72 68 64 60 

79 74 70 66 62 57 53 

76 72 68 64 60 55 52 48 

74 70 66 62 58 53 50 46 42 

71 67 #63 60 56 51 48 45 41 36 

68 64 60 57 53 49 46 43 39 34 30 

65 61 58 54 51 47 44 41 37 33 28 23 
62 59 55 52 49 45 42 39 36 31 27 22 
59 56 52 49 46 43 40 37 34 30 26 21 
56 53 49 47 #44 40 38 35 32 28 24 20 
52 49 46 44 41 38 35 33 30 26 23 19 
47 45 42 40 37 34 32 30 27 24 21 17 
44 42 39 37 32 35 30 28 26 22 19 16 
41 39 36 34 32 30 28 26 24 21 18 #15 
38 35 33 31 29 27 25 24 #22 19 16 18 





Number of days 
l Oe a ee ee ee) 
SOWMOMARDAAWNK CUMBIA AWNHO 
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21 34 32 30 28 26 24 23 21 19 #17 «+15 «12 12 
22 29 27 26 24 23 21 20 #18 #%I7 #+%15 «+13 ~=«10 9 
23 24 22 21 20 19 17 #16 15 +14 #+12 «10~= «8 6 
24 0 0 0 0 0 0 0 0 0 0 0 0 0 
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100 94 84 74 66 58 49 43 37 31 24 #18 12 
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Table VI—Grades and scores for N(t) and L(t) 


Percentile 

range Grade Score Range 
0-5 Excellent 100-95 
6-25 Very-good 94-75 

26-50 Good 74-50 

51-75 Fair 49-25 

76-95 Poor 24-5 

96-100 Unsatisfactory 0 


scheme that allows an objective and consistent evaluation of atten- 
dance would be of potential use to line management. 

In Section II we studied in detail the statistical aspects of absentee- 
ism. Our analysis, in particular Section 2.4, suggested that if one is 
interested in attendance behavior as a personal characteristic, then 
one year is too short for evaluating it. The far past, on the other hand, 
bears little relevance to recent attendance behavior and thus should 
not be included in the attendance evaluation. In this section we suggest 
an evaluation method based on the present and near past (three most 
recent years) that reflects the current year attendance as well as 
attendance behavior in a more general sense. We recall, however, from 
the analysis of Section II that disability absences (DA’s) are intrinsi- 
cally different from incidental absences (1A’s). The high variation in 
the distribution of the duration of DA’s and their low frequency of 
occurrences make it hard to give meaningful statistical guidelines as to 
what can be considered good, bad, etc., behavior regarding DA’s. 
Furthermore, management can do practically nothing to control DA’s. 
We therefore base our attendance evaluation on IA’s only. 

In a sensitive issue such as absenteeism from work, the numerical 
values of the attendance rating do not always tell the whole story. Any 
method for evaluation might occasionally misjudge good employees, if 
it is used in a formal and rigid manner. Thus, the best way to avoid 
these effects is to use it as an informal tool. One has to keep in mind 
that for every absence there is a reason, and these reasons are not 
reflected in the formal attendance ratings. 


3.2 The evaluation procedure 


The proposed scheme is best explained with an example. Consider 
an operator who started to work on January 1970 and whose IA 
occurrences and total time lost (TTL) are given in Table VII. 


Table Vil—Record of IA’s 
Year 1970 1971 1972 1973 1974 1975 1976 1977 1978 


No. of IA occasions 1 4 2 0 0 4 2 0 1 
TTL due to IA’s 1 6 3 0 0 5 4 0 1 
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Table Vill—Example of the evaluation scheme (Lines 4, 5, 6 are read off Tables Va, b, c) 


.(N(1), LQ) 
- (N (2), L(2)) 


- (N(3), L(3)) 


Ra 
Re 
R. 
R 


1970 
(1, 1) 
74 


74 


1971 

(4, 6) 

(5, 7) 
27 
45 


36 


1972 
(2, 3) 
(6, 9) 
(7, 10) 
49 
36 
48 
44 


1973 
(0, 0) 
(2, 3) 
(6, 9) 
100 
71 
53 
75 





1974 1975 1976 
(0, 0) (4, 5) (2, 4) 
(0, 0) (4, 5) (6, 9) 
(2, 3) (4, 5) (6, 9) 
100 30 46 
100 “54 36 
84 70 53 
95 51 51 


1977 
(0, 0) 
(2, 4) 
(6, 9) 
100 
69 
53 
74 


1978 
(1, 1) 
(1, 1) 
(3, 5) 
74 
94 
74 
81 


To evaluate the operator’s attendance we propose the following _ 
simple procedure: 

(1) Determine the first three lines of Table VIII as follows: For each 
year use Table VII to calculate N(t) and L(t) (¢ = 1, 2, 3), the number 
of IA occasions, and the TTL due to IA’s during the ¢ most recent years, 
respectively. 

(iu) For each year, read the ratings associated with (N(1), L(1)), 
(N(2), L(2)), and (N(3), L(3)) from Tables Va, b, and c. These ratings 
are written in lines 4, 5, and 6 of Table VIII, respectively, and their 
interpretation [in terms of percentiles of the marginal distributions of 
N(t) and L(t), t = 1, 2, 3] is described in Section 2.7. 

(tit) Determine line 7 of Table VIII, the attendance index of the jth 
year, according to the following formula: 


R; = avg(Ra,;, Ri, R..;), if Ra,j-1 = Ra; 
: max{R;-1, ave (Raj; Ro, R.;)}, if Raj-1 < Rug ° 


For example, in calculating Ri975 we first compare Raj975 with Ra,1974. 
Since Ra,1974 = 100 = 30 = Ra,1975, we take Rio75 = avg (Ra,1975, Ro,1975, 
Reji975) = (80 + 54 + 70)/3 = 51. On the other hand, in calculating 
Rioze, comparing Ra,i975 with Ra,i976 shows that Ra,i975 = 30 < 46 = 
Ra,i976, SO that Rio76 = max {51, (46 + 36 + 53)/3} = max {51, 45} = 51. 

(iv) The formal evaluation consists of two indices, R, (line 4) which 
is the current year rating and FR (line 7) which can be considered as an 
index for attendance behavior (here we view attendance behavior as a 
personal characteristic of the operator), or in short attendance index. 


3.3 Properties of the proposed procedure 


(t) While the current year rating, Ra, reflects the attendance in the 
most recent year, the attendance index, R, takes the near past into 
account, enabling the operator to build up credit. For instance, while 
1975 itself was a Fair year (R. = 30), in the example of Table VIII the 
attendance index, R, for 1975 was Good (R = 51). This is due to the 
perfect attendance during the previous two years. And indeed, if in 
1975 this operator was considered for promotion, then the score 51 is 
a better indicator of her attendance behavior (considered as a personal 
characteristic) than her current year rating of 30. Similarly, the effect 
of bad attendance cannot be entirely erased in a single year of perfect 
attendance, as can easily be seen in the years 1972 and 1973. 

By the nature of its definition, R is much smoother than R, and is 
a better indicator of attendance. To reemphasize this point, consider 
an operator whose attendance record fluctuates from (N(1) = 0, L(1) 
= 0) to (N(1) = 6, L(1) = 12) to (N(1) = 0, L(1) = 0), ---, etc. The 
current-year rating then fluctuates from R, = 100 (Excellent) to R, = 
0 (Unsatisfactory) while the attendance index fluctuates from R = 58 
(Good) to R = 9 (Poor), which seems more appropriate overall. 
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(ti) If Raj-1< Raj, then R;-1 = R;, or, in other words, if the current 
year rating has improved, then the attendance index will not decrease. 
This follows from the definition of R and is done to avoid negative 
reinforcement. The situation is exemplified in moving from 1975 to 
1976 in Table VIII. There we have Rais = 46 > 30 = Rajig7s (an 
improvement in the current-year rating), so we take Ris7s = 51 = Rigzs 
despite the fact that avg(Ra,1976, Ro,1976, Re,976) = 45 < 51. 

Since the procedure allows the operator to build up credit, the 
reverse situation does not hold and one can have R,j-1 > Ra,; (dete- 
rioration in the current-year rating) with R;-1 < R; (improvement in 
the attendance index). This is exemplified in the ratings of 1977 and 
1978 in Table VIII. And indeed, even though the attendance in 1978 
was worse than the attendance in 1977, the period 1976-1978 as a 
whole reflects better attendance than the period 1975-1977. 


iV. REMARKS 


(tz) Though the technical details (such as the length of the periods 
to be used for rating, and the specific values in Tables Va, b, and c) are 
tuned to telephone operators (more specifically to our sample), the 
method itself can be adapted to other occupations. In occupations with 
substantially higher absence rate, such as auto workers [see, for 
example, the data collected from 60 blue-collar employees of an auto- 
mobile-parts foundry, reported in Morgan and Herman (Ref. 8, pp. 
739)], periods of 1, 2, and 3 years are too far in the past to affect the 
current attendance index and should be replaced with shorter periods 
(e.g., 6, 12, and 18 months). 

(it) As pointed out by a Bell Laboratories referee, the choice of the 
scoring bands in Table VI is somewhat arbitrary, and these bands 
differ from the HOLU (high, objective, low, unsatisfactory) bands that 
were recommended by the AT&T Measurements Task Force. Since, 
however, our main contribution here is the general approach for 
evaluating attendance (i.e., weighing the recent past in the attendance 
index) rather than the particular details, we prefer to leave the expo- 
sition as is. 

(zit) In view of the difference between younger and older operators 
in regard to 1A’s, note that the proposed scheme is tuned to a popula- 
tion of approximately 30-percent younger operators and 70-percent 
older operators (as in our sample). This proportion emphasizes the 
better behavior of the older operators without setting unattainable 
standards for the younger operators. 
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Resource Sharing for Efficiency in 
Traffic Systems 


By D. R. SMITH and W. WHITT 
(Manuscript received June 27, 1980) 


Experience has shown that efficiency usually increases when sep- 
arate traffic systems are combined into a single system. For example, 
if Group A contains 10 trunks and Group B 8 trunks, there should be 
fewer blocked calls if A and B are combined into a single group of 18 
trunks. It is intuitively clear that the separate systems are less efficient 
because a call can be blocked in one when trunks are idle in the 
other. Teletraffic engineers and queuing theorists widely accept such 
efficiency principles and often assume that their mathematical proofs 
are either trivial or already in the literature. This is not the case for 
two fundamental problems that concern combining blocking systems 
(as in the example above) and combining delay systems. For the 
simplest models, each problem reduces to the proof of an inequality 
involving the corresponding classical Erlang function. Here the two 
inequalities are proved in two different ways by exploiting general 
stochastic comparison concepts: first, by monotone likelihood-ratio 
methods and, second, by sample-path or “coupling” methods. These 
methods not only yield the desired inequalities and stronger compar- 
isons for the simplest models, but also apply to general arrival 
processes and general service-time distributions. However, tt is as- 
sumed that the service-time distributions are the same in the systems 
being combined. This common-distribution condition is crucial since 
it may be disadvantageous to combine systems with different service- 
time distributions. For instance, the adverse effect of infrequent long 
calls in one system on frequent short calls in the other system can 
outweigh the benefits of making the two groups of servers mutually 
accessible. 


|. INTRODUCTION AND SUMMARY 


From extensive experience in teletraffic engineering, it is well known 
that congestion can often be reduced by sharing resources. The block- 
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ing probability in a loss system and the average waiting time in a delay 
system are usually much less when separate facilities serving separate 
streams of traffic are combined to serve all the streams together. 
Alternatively, for a given level of congestion, fewer facilities are usually 
required to serve the streams together. Sometimes such results are 
trivial: Whenever the combined system may be managed as if it were 
in fact separate systems, the optimal performance of the combined 
system is at least as good as that of the separate systems. However, 
such management is not allowed in the models treated here. In any 
case, the efficiency of shared resources is certainly a fundamental 
principle of teletraffic engineering. 

The purpose of this paper is to establish versions of this efficiency 
principle mathematically. Our first two results verify conjectures by 
Arthurs and Stuck.' To state our first result, let L(s, A 4) denote the 
stationary loss or overflow rate in an M/M/s loss system (no waiting 
room) with s servers, arrival rate A, and individual service rate p. (See 
Kleinrock” for background on the queuing models.) It is well known 
that L(s, A, w) = AB(s, a), where a = A/p and B(s, a) is the familiar 
Erlang blocking formula: 


" B(s,a) = (ar/st)/ 5, (a“/k!); (1) 


see Jagerman’ and references there. The first efficiency principle we 
establish says that L (s, A, u) is a subadditive function of (s, A) for each 
fixed p: 

Theorem 1: For all positive integers s, and s2 and all positive real 
numbers \y, Az and p, 


L (si + s2, Ay + Av, w) S L(si, Aa, w) + L(S2, rz, yw). (2) 
This yields immediately that 


(3 Six) a) = ¥ G(s, ai) 
inl isl i=] 


for each integer n, where G(s, a) = aB(s, a), which is the version of 
Theorem 1 actually conjectured by Arthurs and Stuck.' 

Of course, Theorem 1 should not surprise teletraffic engineers, since 
it can be inferred from common tables and graphs, but it has apparently 
not been proved before. It appears that all previous mathematical 
results can be described as “one-parameter” results. The relation (2) 
has been deduced for special cases in which quantities such as the 

- blocking probability or the load per server are held constant. For 
example, it is known that if one combines separate groups of 7 and k 
trunks, each operating at a blocking probability 0.01, then the new 
blocking probability will be less than 0.01 (or, alternatively, the com- 
bined system can handle an increased total load and retain the same 
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0.01 blocking probability); see p. 68 of Cooper.* Such results are often 
presented without rigorous mathematical support. 

From Paul Burke we learned about another special case that has 
been known for a long time. It is not difficult to show that B (ts, ta) is 
strictly decreasing in t (see the appendix), from which (2) easily follows 
in the case A;/s; = Ao/s2. Herbert Shulman has also shown that 
Theorem 1 follows easily from the monotonicity of B (ts, ta) in ¢ and 
the convexity of B(s, a) in s for s = 1, but such convexity has not yet 
been established (see the appendix). For further discussion of other 
related work, see Section 5.1 of Kleinrock.” 

To state our second result, let D(s, A, 1) denote the mean steady- 
state delay in an M/M/s queue with infinite waiting room, FCFs (first- 
come, first-served) queue discipline, s servers, arrival rate A, and 
individual service rate p. It is well known that D(s, A, uw) = C(s, A/p)/ 
(su — A), where C(s, a) is the Erlang delay function: 

S/(o — ])t (s — 
beams: a’/(s — 1)! (s — a) (3) 
YX (a*/k!) + a®/(s — 1)! (s — a) 
k=0 


The following result establishes subadditivity of D(s, A, 4) as a function 
of (s, A) for each fixed yp. Note that 


[Ai/(A1 + A2)]D(s1, Ai, w) + [A2/(Ai + Az) ]D (s2, Az, b) 


is the overall average delay experienced in the separate systems 
because A;/(A; + Az) is the long-run proportion of customers to enter 
the first system. 

Theorem 2: For all positive integers s; and sz and all positive real 
numbers i, d2, and p, 


D(s) + S2, Ay + Xa, B) 
= [Ai/ (i + Ar) JD(si, Ai, Ht) + [Az/(Ay + Az) ]D(s2, A», Lt). (4) 


This yields immediately that 


a 3 si, >, «) = }) H(si, ai) 
i=) isl i=1 


for each integer n, where H(s, a) = aC(s, a)/(s — a), which is the 
version of Theorem 2 actually conjectured by Arthurs and Stuck.’ 

In order to prove Theorems 1 and 2, we found it convenient to prove 
stronger results. It is helpful to see how the loss rate L(s, A, u) and the 
mean delay D(s, A, ») are related to the steady-state number of 
customers in the system, say Q. In the M/M/s loss system 


L(s, A, w) =A — pEQ (5) 


because A is the arrival rate and pEQ is the service-completion rate; 
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the loss rate is that fraction of the arrivals that are not served. In the 
M/M/s delay system 
A(D(s, A, w) + pw’) = EQ (6) 
by virtue of the fundamental relation L = \ W; see Stidham.° 
Let Q; be the steady-state number of customers in the ith system 


(t = 1, 2) and let Q be the steady-state number of customers in the 
combined system. Then Theorem 1 is equivalent to 


EQ = EQ: + EQ: (7) 
for the loss systems, and Theorem 2 is equivalent to 
EQ = EQ, + EQ: (8) 


‘for the delay systems. 

Instead of comparing the means in (7) and (8), we prove Theorems 
1 and 2 by making more general stochastic comparisons. We do this in 
two different ways. Our first method of proof is to compare the 
distribution of Q with the distribution of Q, + Q». It turns out to be 
very easy to establish an appropriate ordering for the entire distribu- 
tions, which in turn implies the desired inequality for the means. The 
appropriate order is the monotone likelihood-ratio ordering. We define 
this ordering and prove the more general theorems implying Theorems 
1 and 2 in Section II. 

Our second method of proof is to compare entire stochastic processes 
rather than just stationary distributions. As corollaries we obtain 
stochastic-order relations for the stationary distributions which in turn 
also imply the desired inequalities (7) and (8) for the means. This 
approach has the advantage that the arrival processes can be arbitrary 
rather than Poisson and the service-time distributions can be general 
instead of exponential. The argument is also remarkably simple. The 
idea in this approach is to construct artificially the two stochastic 
processes being compared on the same probability space. The construc- 
tion is carried out so that each stochastic process individually has the 
correct distribution (family of finite-dimensional distributions) as orig- 
inally specified. We choose a special joint distribution so that each 
sample path of one process always lies below the corresponding sample 
path of the other process. Because the construction is artificial, the 
joint distribution of the two processes is not directly meaningful, but 
it implies a strong stochastic ordering for the processes. Such special 
constructions have been used previously to compare queuing processes; 
see Sonderman,’ Whitt,® Wolff? and references there. In fact, the 
generalization of Theorem 2 is a direct consequence of Wolff’s theorem 
and the other proofs involve similar reasoning. We present our results 
using this approach in Section III. 
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In Theorems 1 and 2 we assume equal service rates in the two 
systems. It is natural to ask whether extensions of (2) and (4) hold 
when the service rates are unequal. In Section IV we show that, with 
unequal service rates, combining resources need not be more efficient; 
in fact it can substantially degrade performance. Infrequent “bad” 
customers from one system can adversely affect a large number of 
“good” customers from the other system. 


ll. MONOTONE LIKELIHOOD-RATIO COMPARISONS 


Let X and Y be random variables assuming values in the nonnegative 
integers. We say X is less than or equal to Y in the monotone 
likelihood-ratio ordering and write X s, Y if 


PXX=k+1)_ PY =k +1) 
P(X=k) ~~ P(Y=hk) 


for all integers k; see page 208 of Ferguson.'” We say X is stochastically 
less than or equal to Y and write X <,, Y if Ef(X) = Ef(Y) for all 
nondecreasing real-valued functions f for which the expectations are 
well defined. Obviously, EX = EY whenever X S,, Y. What is important 
for Theorems 1 and 2 is that X =, Y implies X s,, Y. This is well 
known and not difficult to show. In fact, the monotone likelihood-ratio 
ordering is equivalent to stochastic order for all conditional distribu- 
tions obtained by conditioning on subsets, ie., E(f(X)|X € A) S 
E(f(Y)| Y € A) for all subsets A and all nondecreasing real-valued 
functions f; this property is discussed in Whitt’?”; see Keilson and 
Sumita”™ for additional material. 

Returning to the notation of (7) and (8), we obtain the following 
results which imply Theorems 1 and 2. 
Theorem 3: For the M/M/s loss systems, 


Qi + Q2 <, Q. 
Theorem 4: For the M/M/s delay systems, 


Q <, Qi + Qe. 


Theorems 3 and 4 can each be proved by simple calculations since 
the stationary distributions are known and easy to work with. To 
illustrate, we do one proof. 

Direct Proof of Theorem 3: Let a; = di/pi for i = 1, 2. Then, using 
convolution, we obtain for some constant C 
P(Qi+Q2=kR+N=C YY aita?/istie! 
OS1)<s 


; OSIg=s2 
iy +ig=k+1 


(9) 


RESOURCE SHARING IN TRAFFIC SYSTEMS 43 


Ca, 





> a} ai/ (is = 1)!z.! 





k+1 1st)<s, 
OSto<s9 
ijtig=k+1 
Caz - piece 
+ a}as?*/i!(i2 — 1)! 
h+1 ccs fas /t! (te ) 
1<ig=so 
a, + a2 Bo Oo ads ahi 
< {———_]C 1a5?/1y!te! 
( hel coe, ayvas?/ty!t2 
OSiy<s9 
i,+ig=hk 
P(Q =k +1) 
= —————— P(Q + Q2 = k). 
POam Pie + Q=h) i 


It is also significant that both Theorems 3 and 4 can be viewed as 
trivial corollaries of a more general theorem. This more general theo- 
rem is especially useful for comparisons when the limiting distributions 
are not known. To state our general result, consider two stochastic 
processes on the integers, Y:(¢) and Y2(t), that move only by jumps up 
or down in unit steps to one of the neighboring states. Let all the 
transitions be governed by birth-and-death rates, but in contrast to 
those in birth-and-death processes, these rates may depend on infor- 
mation other than the current state such as the history of the process 
or other relevant variables. Let A;(k, I:) and pi(k, I:) be the birth-and- 
death rates, respectively, for the ith process (i = 1, 2) in state k with 
additional information J; at time ¢. By having transitions governed by 
birth-and-death rates, we mean that 


P(Yi(t +h) =k+1| Yilt) =k, I) = hat, I) + o(h), 
P(Y(t +h) =k—-1| Yilt) =k, I) = hut, I) + ofA), 
and 
P(Yi(t + h) =k| Yt) =k, I) =1— Aft, [) + wilt, 1)] + ofA), 


where o(h) means a quantity that converges to zero after division by 
h as h > 0. Let X; and X, be random variables with the limiting 
distributions of these two stochastic processes, which we assume exist 
as proper distributions. Here is our general monotone likelihood-ratio 
comparison result. 

Theorem 5: Consider the processes Y(t) and Y,(t) defined above. 
Suppose there exist sequences of constants {a(k))} and {B,(k))} such 
that 


A(R, I) Salk), alk, Lr) = a2(k), 
palk, I) = Bi(k), p(k, Ii) = B2(k), 
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for all k and I;. If ai(k)/Bi(k + 1) S ac(k)/B2(k + 1) for all k, then 
Xi =, Xo. 
Corollary: If 


Ai(k, Lr) S Ao(k, It) 
and 
balk, I;) = polk, It) 


for all k, I,, and Ii, then X, =, X2. 

Proof of Theorem 5: Look at the stationary flow between states k and 
k + 1. The flow from k to k + 1 is less than or equal to P(Xi = k)ai(k) 
for process 1 and greater than or equal to P(X2 = k)ae(k) for process 
2. Similarly, the stationary flow from k + 1 to & is greater than or 
equal to P(X; = k& + 1)f:(k) in process 1 and less than or equal to 
P(X2 = k + 1)B2(k) in process 2. Since the stationary flow from k to 
k + 1 must equal the stationary flow in the opposite direction, 


P(X = k)aa(k) = P(X =k + BAR + 1) 
and 

P(X2 = k)axlk) S P(X. =k + 1)B(k + 1). 
Consequently, 


P(X; =k + 1) 2 ai(k) 2 a2(k) - P(X, =k + 1) q 
P(X, = k) Bik +1) B.(k + 1) P(X2 = k) 
We can now apply the corollary to Theorem 5 to prove Theorems 3 
and 4. 


Second Proof of Theorem 3: Note that the processes depicting the 
number of customers being served satisfy the hypotheses of Theorem 
5. In the case of two separate facilities, the sum is not a birth-and- 
death process because the rates depend not only on the total number 
but how many are in the individual facilities. When k customers are 
present, the death (service) rates are identical, but the birth (arrival) 
rates can be higher in the combined system because if one of the 
separate facilities is full, then it cannot accept any more arrivals. 
Hence, the hypotheses of the corollary to Theorem 5 are 
satisfied. O 

Proof of Theorem 4: Again we apply the corollary to Theorem 5. The 
reasoning is similar except here when & customers are present, the 
birth (arrival) rates are always identical, but the death (service) rates 
can be less with the separate facilities because there can be idle servers 


in one facility while there are customers waiting in the other 
facility. O 
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_Ul. SAMPLE PATH COMPARISONS 


Let (X(t), = 0} and { Y(t), t= 0} be real-valued stochastic processes. 
We call a real-valued function f defined on the space of all sample 
paths of X(t) and Y(¢) nondecreasing if f({x(¢), f= 0}) = f({y(d), t= 
0}) for all sample paths {x(t), ¢ = 0} and {y(é), ¢ = 0} such that x(Z) 
<= y(t) for all t = 0. We say the stochastic process {X(t), t = 0} is 
stochastically less than or equal to the stochastic process { Y(t), t= 0} 
and write {X(t), t= 0} <w (Y(t), t = 0} if f({X(@, t= 0}) su f((Y(O, 
t = 0}) for all nondecreasing real-valued functions f defined on the 
sample paths of X(t) and Y (¢). Clearly, stochastic order of the processes 
implies X(t) ss: Y(t) for each ¢t [just use the projection: f({x(u), u = 
0}) = x(é)], but it is much stronger, applying to many other nonde- 
creasing functionals. In fact, since the queuing processes have sample 
paths with left and right limits everywhere, stochastic order of the 
processes is equivalent to stochastic order for all finite-dimensional 
(joint) distributions; see Section 4 of Kamae, Krengel, and O’Brien." 
Moreover, stochastic order of the processes here is equivalent to the 
possibility of a strong sample-path comparison. In particular, 


{X(t), = 0} Ss {Y(é), t= 0} 


holds if and only if it is possible to construct stochastic processes 
{X(t), £ = 0} and {Y(t), £ = 0} on a common probability space such 
that {X(t), ¢ = 0} has the same distribution as {X(¢), t = 0}, 
{Y(t), t = 0} has the same distribution as { Y(t), ¢ = 0}, and every 
sample path of {X(t), t = 0} lies below the corresponding sample path 
of {Y(é), t = 0}; see Theorem 1 of Kamae, Krengel, and O’Brien.“* 
What we do is apply the easy half of this equivalence—the fact that 
the sample-path construction implies stochastic order—to make sto- 
chastic comparisons between the queuing processes. The proofs here 
are done by actually constructing processes with the sample-path 
ordering. Previous uses of such constructions appear in Sonderman,’ 
Whitt,’ Wolff? and references therein. The approach is also closely 
related to the so-called “coupling” techniques; see Lindvall’ and 
references therein. 

We begin with the generalization of Theorem 2 for delay systems 
because it follows directly from Wolff.? As before, we assume the FcFS 
discipline, but now we allow the arrival streams in the two separate 
systems to be arbitrary. We assume the service times are independent 
of the arrival processes and mutually independent and identically 
distributed, but they need not be exponentially distributed. Since the 
arrival process is assumed to be independent of the service-time 
sequence, the evolution of the arrival process cannot depend on the 
state of the system. This excludes finite-source models, for which 
counterexamples to the efficiency of sharing are easy to construct; for 
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example, see page 1377 of BeneS.’® Let Q;(t) be the number of customers 
in the ith system and let Q(t) be the number of customers in the 
combined system at time ¢. 

Theorem 6: (Wolff) If Q:(0) = Q2(0) = Q(0) = 0, then {Q(é), t = 0} 
Sse (Qilt) + Qo(t), f= 0}. | 

Remarks: (i) Wolff® was actually interested in comparing the FcFS 
discipline with the cyclic assignment discipline in a single delay-system. 
He showed that the queue length process with the FcFs discipline is 
stochastically less than the queue length process in the same system 
with any other discipline. This result applies here because the two 
separate facilities can be interpreted as a single system with a special 
queue discipline: Just label the arrivals in the special system according 
to the stream from which they came and then assign them according 
to the FcFs discipline to one of the servers in the corresponding 
subgroup of servers. 

(tt) We can obtain corresponding results if the systems are not 
empty initially. For more general initial conditions, we can assume 
appropriate stochastic order for the residual service times at ¢ = 0. 

(iii) Wolff® also obtained similar comparison results for other pro- 
cesses, all of which hold here too: the departure epochs, the number of 
customers in queue, the total work (in service time) in the system, and 
the total work in queue. By the sample-path construction, the sto- 
chastic order jointly holds for all these processes. See Theorem 8 here. 

(tv) As a consequence of Theorem 6, Q(t) Ss: Qi(é) + Qe(t) for each 
t. With the general assumptions here, steady-state distributions need 
not exist, but if @,(¢) and Q(¢) converge in law to Q; and Q, respectively, 
as t > o, then Q@ <u Qi + Q2; see Proposition 3 of Kamae, Krengel, 
and O’Brien.“* The convergence of course holds in the setting of 
Theorem 2, so Theorem 6 implies (8) and thus Theorem 2. 

(v) Since Theorem 2 concerns the mean-waiting time, it is natural 
to ask if the steady-state waiting-time distribution is also stochastically 
less in the combined system. Unfortunately, in general it is not. The 
counterexample in Whitt” applies here too; the cyclic discipline there 
can be interpreted as arrivals to separate facilities. 

(vi) When the arrival streams are not Poisson, which we now permit, 
a new phenomenon occurs. Then the customers in the different streams 
experience different congestion when the systems are combined, even 
if the service times are independent and identically distributed. This 
phenomenon can be an important consideration in combining systems, 
but we do not consider it here; it has been studied by Kuczura.'*”” 

We now turn to our generalization of Theorem | for loss systems. In 
addition to allowing arbitrary arrival streams and general service-time 
distributions, we allow a finite waiting room. The number of waiting 
spaces in the combined system is the sum of the numbers of waiting 
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spaces in the separate systems. Let N,(t) [N(¢)] be the number of 
customers lost in the interval (0, ¢) in the ith separate system (in the 
combined system); let S;(¢) [S(¢)] be the number of service completions 
in the interval (0, ¢) in the ith separate system (in the combined 
system); and let C;(t) [C(¢)] be the amount of work performed—service 
given—in the interval (0, ¢) in the ith separate system (in the combined 
system). 

Theorem 7: If Q:(0) = Q2(0) = Q(0) = 0 in these systems with finite 
waiting rooms, then 


{N(t), £= 0} Sw {Nilt) + N2(t), f= 0}, 

(S(t), t= 0} Sw {Si(t) + S2(t), t= 0}, 
and 

(C(t), t= 0} See {Cilt) + Cr(t), t= 0}. 


Now assume that N,(¢)/t and N(t)/t converge (either in probability 
or with probability one) as t > o. Let the limits be denoted L(s,, ,, 
Aj(t), F) and L(s; + Se, ki + Ro, Ai(t) + Ao(t), F'), respectively, with k; 
denoting the number of waiting spaces, A,(¢) the arbitrary arrival 
process and F(x) the general service-time c.d.f. From Theorem 7 we 
immediately obtain the following generalization of Theorem 1. 
Corollary: For all positive integers si, s2, ki and ke; all arrival 
processes A;(t) and A2(t); and all service time c.d.f.'s F(x) such that 
the loss-rate limits exist, 


L(s1 + S2, ki + Re, Ai(t) + Ao(t), F) 
<= L(si, ki, A; (2), F) + L(s2, ko, A2(t), F). 


To prove Theorem 7, we establish a finite-waiting-room generaliza- 
tion of Wolff’s’ comparison theorem. Following Wolff, we shall state 
the result in terms of the sample-path comparison. Since the joint 
distribution of the two systems being compared is artificially obtained, 
the appropriate conclusion is the general stochastic order as in Theo- 
rems 6 and 7. 

We carry out the artificial construction by letting the systems being 
compared have identical arrival processes and service times. Note that 
we are now focusing on a single (arbitrary) sample path. We let the 
nth service time vu, be associated with the nth customer to enter 
service in each system rather than the nth arrival. Let a, be the arrival 
epoch of the nth arrival, 0 <= a; S a, = ---. We assume there are s 
servers operating in parallel and k extra waiting spaces in both systems. 
We also assume the systems are initially empty. 

One system, called the original system, will be the conventional 
system where the servers are fed by a single queue using a FCFS 
discipline. Moreover, there are k extra waiting spaces and arriving 
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customers enter the system if the number of customers in the system 
is less than s + k, and are lost otherwise. 

The other system, called the modified system, is any alternative to 
the original system which assigns customers to servers in some manner, 
independent of the sequence of service times {v,}, and which loses 
arrivals whenever the system is full and in some manner otherwise. 

Let a, be the arrival epoch of the nth customer. For the original 
system, let t, be the time that the nth customer to enter the system 
enters; obviously ¢, = a, for some k, k = n. Also, for the original 
system, let b, be the time the nth customer to begin service begins and 
let d, be the nth ordered departure epoch from the system. Let ap, th, 

n, and d;, be the corresponding quantities for the modified system. 
Theorem 8: For all integers n, tn S th, bn S bn, and d, = dh. 
Proof: The sets of unordered departure epochs in the two systems are 
clearly {(b, + v,)} and {(b;, + v,)}, respectively. For the original 
system, 
d, = min {6; + uj} = min {¢; + vj}, 


l<i<s i<iss 


d,, = nth-order statistic from {(b; + ui): 1<n + s} (10) 
and 
bn = max{tn, dna), n=l, (11) 
where d; = 0 if 7 = 0. For the modified system, 
d;, = nth-order statistic from {(b; + vj): i1<n +s} (12) 
and 
bh = max{tn, dis}, n=l, (13) 


because in the modified system it is possible to have a positive queue 
and an idle server. 

Since the nth-order statistic is a monotonic function, to prove 
Theorem 8 it suffices to show that ¢, = ¢, and b, = 6, for all n. We 
show this induction. Obviously 6; = ¢; = a; S tis bj, 11 < s. Suppose 
t; = tj; and 6; = 0b; for all 14, i = n — 1. We first show that ¢, < th. 
Suppose not; then 


th>t,2=thiz= En-1, 


and thus n — 1 customers have entered both systems before the arrival 
associated with ¢,. (Note that customers could arrive in batches, i.e., 
Qe = Qp+1 iS a possibility, but this presents no serious difficulty.) 
However, by the induction hypothesis b; + v; = bj + u;, 1S n — 1, so 
the original system has at most the same number of customers as the 
modified system before the arrival associated with ¢;,. Thus, t, > t’n 
cannot occur. Hence ¢, < ¢;, as claimed. 
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To continue the induction proof for b,, note that (10) and (12) imply 
that d; <= d; for i =n — s. Then, from (11) and (13), we have 


bn = max{tn, dns} S max{t;, dr_-s} = bn, 


which completes the proof. CL] 

Remark: Our proof of Theorem 8 is closely related not only to Wolff’s 
proof,’ but also to Sonderman’s comparison proofs.”””’ Sonderman was 
concerned with the effect of different service-time distributions instead 
of different queue disciplines. 

We close this section with another result about pure-loss systems. 
With waiting rooms or with general service times it is easy to show 
that the stochastic processes representing the number of customers in 
the system need not be stochastically ordered, but we do get stochastic 
order with exponential distributions and no waiting rooms. 

Theorem 9: In the setting of Theorem 7, if there are no waiting rooms, 
if the service-time distribution is exponential and if Q(0) =ste Qi(0) + 
Q2(0), then 


{Q(t), t = 0} Sse (Qil(t) + Qo(t), £ = 0} 
and 
{N(t), t= 0} Sse {Ni(t) + No(t), t = O}. 


Proof: Here the argument follows Sonderman’”*”' and Whitt.’ As the 
first step in constructing the two systems on the same probability 
space, we let the two systems being compared have identical arrival 
processes; i.e., we let the arrival process to the combined group of s; 
+ s) servers be the sum of the two arrival processes to the separate 
groups of s; servers. This not only means that the arrival processes 
have the same joint distributions, but that they have the same sample 
paths. Similarly, we let both systems start off with the same number 
of customers in the system; i.e., given the pair [@:(0), Q2(0)], we let 
Q(0) = @1(0) + @2(0). We now show how to construct the departures 
so that 


N(t) = Ni(t) + No(t) (14) 
and 
Q(t) = Qilé) + Qe(t) (15) 


for all t= 0. We generate departures from both systems using a single 
Poisson process with rate (s; + sz). Each point in this Poisson process 
corresponds to a potential departure. Suppose the point occurs at time 
t. With probability Q,(t)/(si + s2), the point corresponds to a departure 
from both the single group of si servers and the combined group of s; 
+ So servers; with probability Q2(t)/(si + s2), the point corresponds to 
a departure from both the single group of sz servers and the combined 
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group of s; + sp servers; with probability [Q(t) — Qi(é) — Qe(t)]/(si + 
S2) the point corresponds to a departure from only the combined group 
of s; + Se servers; and finally, with probability [s; + s2 — Q(t)]/(si + 
S2), the point corresponds to no departure at all. This can be shown to 
yield the proper distributions for each system; see Sonderman’ for 
more detail. This also guarantees that there is a departure in the 
combined group of s; + Ss. servers whenever there is a departure from 
one of the groups with s; and so servers. There also cannot be a 
departure from the combined group alone when Q(t—) = Q,.(t—-) + 
Q2(t—), so inequality (15) is maintained. This means that all departures 
and losses from the combined group of s; + s2 servers that are not 
matched by corresponding departures or losses from one of the groups 
of s) and se servers can be matched with earlier losses from one of the 
groups of s; and s2 servers. Mathematical induction on the arrival 
index establishes (14) and (15) and formally completes the proof. O 
Remark: For the special case of M/M/s systems, the stochastic order 
in Theorems 6 and 9 can also be established under the conditions in 
the corollary to Theorem 5 using existing comparison theorems for 
continuous-time Markov chains; see Sonderman.’ However, we know 
of no direct connections between the monotone likelihood-ratio order- 
ings and the sample-path orderings. 


IV. DIFFERENT SERVICE RATES 

In this section we let the service rates in the two separate systems 
be different. One way to extend (2) and (4) occurs when the service 
times are associated with the arrivals. If two independent Poisson 
streams with rates A; and A2 and associated service-time c.d.f.’s Fi(x) 
and F(x) are combined, then the resultant stream is a Poisson stream 
with rate A, + Az and associated service-time c.d.f.: 


F(x) = [Ai Fi(x) + AoF2(x)]/(A1 + Az). 


Of course, when F;(x) is exponential with mean pj" for each i, F(x) is 
not exponential unless 41 = pe. However, the blocking probability for 
an M/G/s loss system depends only on the mean service time. Thus 
the loss rate for the combined system is L(s; + se, Ai + Ae, (Ai + A2)/ 
(a, + a2)), and a natural extension of (2) to conjecture is 


L(s, + S2, Ai + Az, (Ar + Az) /(@Qi + @2)) S D(si, Ar, pi) + L(Se, Az, pe). 


Unfortunately, this conjectured inequality is not valid. To see this, let 
Ai = 1, pi = €7', Ao = €, and po = €”; then a; = € and a2 = €~'. Obviously, 


L(s1 + S2, Ar + Aa, (Ar + A2)/(a1 + a2)) = (Ai + Az) B(si + S2, a1 + a2) 
= (1 + e)B(si + S2, € + e') 


—1 as e—0, 
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whereas 
L(s1, Ai, pa) + L(se2, Az, pe) = Ai B(si, ai) + A2zB(se, a2) 
B(si, €) + €B(s2, €~*) 


—0O as e—0. 
Consequently, in this case 
L(s1 + S2, Ai + As, (Ar + A2)/(aQi + @2)) = L(si, Ar, pr) + L(se, Az, pe) 


for sufficiently small «. The previous measure, rate of customer loss, is 
not the only reasonable way to evaluate system performance in this 
case. For example, one might be interested in the rate of loss of service 
time. (Note that there is no real difference between these measures 
when the mean service times of the systems are identical.) With this 
new measure, the natural extension of (2) to conjecture is: 


Qi + Qe 


7s rf 2 L(s; + S2, At + Ao, (Ai + A2)/ (ai + a2)) 





1 1 
=— L(si, Aa; Li) Fed L (ss, da; fle). 
Bi be 


This inequality is in fact always true, since substitution of L(s, A, p) 
= \ B(s, A/p) quickly reduces it to the second version of the inequality 
of Theorem 1. Thus the server occupancy is always increased for the 
combined system. 

Turning to delay systems, we again find examples where sharing can 
be counterproductive. To see this, consider two M/M/1 delay sys- 
tems with A; = 1, pi = 2, Ao = €, and pe = 2e. Then EQ,(%) = HQs(o) 
= p/(1 — p) = 1, but EQ(~) can be shown to be of order €~’ as € > 0: 
Consider the interval following a low-intensity arrival. With probability 
Ao/(Az + pe) = %, a second low-intensity arrival occurs before the first 
departs. Then there follows an exponentially distributed interval of 
mean length 1/4¢ during which the combined system fills up with high- 
intensity customers. In computing the average number of customers 
in the system, we get a term of order €~” (the total area in the plot of 
the number of customers in the system versus time, starting from the 
moment the second low-intensity customer arrives and ending when 
one of the two low-intensity customers departs), divided by a term of 
order e~’. In other words, with the mean steady-state delays held fixed 
in the two separate systems, the mean steady-state delay in the 
combined system can be arbitrarily large. 

Note that the combined system can be modeled as an M/G/s+s2 
delay system where the service-time distribution is the mixture of two 
exponential distributions, but in contrast to the case of loss systems 
the mean delay does not depend only on the mean of the service-time 
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distribution. Hence, the appropriate generalization of (4) involves a 
system which is not M/M/s. 

Another possible extension for pi 4 2 occurs when the service-time 
distributions are associated with the servers. Here the combined 
system is not M/M/s because there are heterogeneous servers, so there 
are no equations similar to (2) and (4). In this case, it can be shown 
that with exponential service-time distributions and no waiting rooms, 
resource sharing is always better if customers always are sent to the 
fastest available server. In particular, as in Theorems 6 to 8, it can be 
shown for any single system that assigning customers to the fastest 
available server produces fewer losses than any other rule, where by 
“fewer losses” we mean in the sample-path ordering of Section III. One 
other rule, corresponding to the two separate systems, is to assign the 
customer only to servers associated with their original separate arrival 
streams. 

When we focus on delay systems with heterogeneous servers, it is 
easy to give counterexamples showing that resource sharing can again 
be counterproductive. Related literature on the assignment of cus- 
tomers to heterogeneous servers appears in Winston,”* Smith,” and 
references therein. 

This section shows that, with unequal service-time distributions, 
resource sharing can be counterproductive. However, with unequal 
service-time distributions, much depends on the criterion of system 
performance. Also, it should be noted that such counterexamples have 
been observed before; others have discovered that infrequent “bad” 
customers can affect adversely a large number of “good” customers. 
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APPENDIX 


Here we give two results that are due to others. First, we present Paul 
Burke’s proof that B(ts, ta) is strictly decreasing in ¢ for f = 0. This 
result implies Theorem 1 when Ai/si = dAo/s2, because then 
B(s,: + Se, Qi + de) = B(ts;, ta;) for some t = 1, so B(si, ai) = 
B(s, + Se, ai + a2) for each i and 


Ai B(si, a1) | A2B(se, az) 


pe pS Be oer ee 


which is equivalent to (2). 
To see that B(ts, ta) is strictly decreasing in ¢, first recall the 


following equation relating two different expressions for the tail of the 
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gamma distribution: 





k=n a ie 4 
= 2 xe 
e“ YY a*/ki= — dx. 

k=0 " n: 


Cc 
| e-*x" dx 
1 ta 


Bits, ta) = e “(ta)” 


Then note that 


= | e~ "(x /ta)™ dx 


= | e-*(1 + [x/ta])" dx; 


also see Theorem 3 of Jagerman.* Finally, (1 + [x/ta])* is strictly 
increasing in ¢. 

Second, Herbert Shulman has shown that Paul Burke’s result and 
the convexity of B(s, a) in s for s = 1 imply a version of Theorem 1. 
Such convexity has frequently been conjectured but has been proved 
only for lattices of points with unit spacing, see Messerli” and refer- 
ences therein. These versions of convexity are not strong enough to 
make the following valid even when s; and s2 are integers; however, 
general convexity would establish the proof for all real numbers s; and 
S. = 1, a more general mathematical result. We reproduce Shulman’s 


argument here: 
a a; + a2 
B(s; + Se, Qi + az) <ta(S2* 
a, + a2 Q\ 


a» a(* + A» 


8}, Ay + a) 


— S2, A) + Q2 
a, + ae ay 
ay a2 
=— B(s, a1) + ——— B(sz, az). 
a; + a a; + a2 
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Transient Behavior of the Kendall Birth-Death 
Process—<Applications to Capacity Expansion 
for Special Services 


By R. N. NUCHO 
(Manuscript received October 16, 1979) 


In this paper we derive explicit expressions for the transient state 
probabilities of the Kendall birth-death process, with and without 
immigration, for any initial condition. We then propose this process 
as a model for special services point-to-point demand, in which the 
births represent circuit “connects” and the deaths represent “discon- 
nects.” This choice of model is based on intuitive arguments and on 
the fact that the model can represent the growth and turnover 
characteristics of special services demand. Thus, the model provides 
a means by which special services demand, with its inherent uncer- 
tainty, may be approximately represented in various facility network 
studies, to obtain, at the very least, useful qualitative results. In 
particular, we evaluate the probability of a held order (i.e., the 
probability that a service request is held for lack of spare facilities) 
with Blocked Customers Held (BcH) as the queue discipline. We also 
apply the model to capacity expansion problems, introduce the con- 
cept of margin, the extra capacity needed to meet the demand within 
a given held-order probability, and examine its sensitivity with re- 
spect to growth, turnover (or churn), and system size. We find that 
aggregating small demands into a single larger demand produces 
significant reduction of the margin, because of improved statistical 
properties. 


1. INTRODUCTION 


In this article, the transient behavior of the Kendall birth-death 
process* with immigration is examined, and some applications of the 


* The Kendall birth-death process is one in which the transition rates are proportional 
to the state. 
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process to capacity expansion problems are discussed. The choice of 
such a process was motivated by the search for a model for special 
services point-to-point circuit demand, a model which would be used 
as a tool for determining facility network circuit routing strategies. 
Special services demand generally consists of demand for full-time 
dedicated circuits (e.g., foreign exchange lines, waTs lines, data lines), 
as opposed to the message-traffic offered load which consists of de- 
mand for the use of common facilities for a relatively short period of 
time. Thus, the system examined is characterized by the stochastic 
process .(¢) with realizations (states) n = 0, 1, ---, 0, where n might 
refer to the number of working circuits or some other facility, rather 
than to the number of busy trunks, as in the message-traffic case. By 
definition, the birth-death process’ allows transitions from some state 
nton-+ 1 viaa birth (circuit connect), or to m — 1 via a death (circuit 
disconnnect). The transition rates are A, for the births and p, for the 
deaths, both of which are chosen proportional to n for the following 
reasons. 

It is clear, for special services, that the rate of disconnects, pp, is 
state dependent. There are, in fact, indications’ that pi, is a monoton- 
ically increasing function of n. The simplest such function is ny, which 
implies that the probability of disconnects is proportional to the size 
of the system. With this choice for the death rate, a number of possible 
choices exist for the birth rate. Choosing it to be a constant causes the 
mean number of circuits to saturate in time, while choosing it to be 
proportional to n causes the mean to grow or decline exponentially. 
Since special services are presently characterized by significant net 
growth, it would seem that a plausible model for special services 
demand is a birth and death process in which both the birth and death 
rates are proportional to the state. 

One consequence, however, of assuming A,, = nA is that if the process 
reaches the state n = 0 at any time, by a succession of disconnects, it 
will stay there forever, since the birth rate is zero. To eliminate this 
characteristic, the concept of immigration may be introduced by taking 

n = nd + B, where B is the immigration factor. The cases with and 
without immigration will be discussed below. 

It must be emphasized that it is not the intent of this paper to 
validate the model based on an examination of actual special services 
data. Such statistical data analysis is important for a final assessment 
of the accuracy of the model and is currently being undertaken. For 
the purposes of this paper, it shall be assumed that a study of the 
proposed model is justified, based on the intuitive arguments given 
above and on the fact that the model captures the growth and turnover 
characteristics of special services (see Section 5.1). The model provides 
a means by which special services demand, with its inherent uncer- 
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tainty, may be approximately represented in various special services 
facility network studies, to obtain, at the very least, useful qualitative 
results. 

Since the situation of interest is that of net growth, it is clear that 
statistical equilibrium does not exist, and that it is the problem of the 
transient solutions of the Kolmogorov birth-death equations that is of 
prime importance. Much literature exists on the subject of transient 
solutions for birth-and-death processes’*""' and the case in which the 
transition rates are state independent is completely solved.”® The case 
in which A, and yp, are proportional to the population is solved when 
immigration is not included: The results for a specific initial condition, 
namely starting from the state n = 1, are derived in Refs. 4 and 10 and 
the expressions for the general initial condition are quoted in Ref. 10. 
For the nonzero immigration case, the form of the generating function 
for the state probabilities is known,’° but it seems that explicit expres- 
sions for the state probabilities have not previously appeared in the 
literature. In this paper, these expressions are derived for any non- 
negative value of £. 

In special services, if an order for service is delayed because of lack 
of spare facilities, the order is said to be held. Thus, in order to study 
capacity expansion problems, the probability of a held order is intro- 
duced, as well as the concept of margin, the extra capacity needed to 
meet the demand within a given held-order probability. This held- 
order probability is similar but not identical to the transient time 
congestion of the process (see Appendix B). The queue discipline 
followed here is Blocked Customers Held (BcH), in which an arriving 
customer spends a total time T (random variable) in the system, after 
which he departs regardless of whether he is waiting to be served (i.e., 
his service order has been delayed) or is actually being served (i.e., he 
has been assigned a circuit). 

A fundamental difference between this analysis and teletraffic must 
be emphasized. This difference arises because of the respective time 
scales in the two cases. Whereas the mean lifetime of a call in traffic 
(t = 1/p) is of the order of a few minutes, the mean lifetime of a circuit 
in the process described here is of the order of a few years. It is this 
fact, coupled with the relatively fast growth of special services demand, 
that makes it impossible to even approximately treat the process in a 
statistical equilibrium mode (no growth) with a slowly varying enve- 
lope representing the growth. Thus, the transient aspect of the problem 
is to be contrasted to the more conventional assumption, in teletraffic 
theory, that statistical equilibrium prevails (it must be mentioned, 
however, that some work has been done concerning nonstationary 
telephone traffic with time-varying Poisson-offered load, e.g., Refs. 12 
to 14). It must be further noted that, although the model is being 
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proposed for special services demand, nevertheless, it may be applied, 
with an appropriate choice of parameters A, 8, and p, to any process 
that behaves in a similar manner. 

The held-order probability having been defined and the concept of 
margin introduced, questions concerning capacity expansion problems 
are addressed. Capacity expansion is a problem that has been studied 
by many. In this paper, optimal capacity-expansion policies are not 
sought; only very specialized aspects of the problem are considered. 
For instance, the effects of aggregating demands into a larger single 
demand are examined, and the minimum capacity increment which 
would meet the demand within a specified interval of time and within 
a given held-order probability is determined. In addition, the relation- 
ship between spare capacity and lead time is discussed (see summary 
of results in Section II). Some relevant work has been done by 
Freidenfelds’*”® in which the author computes first-passage times to 
various levels of demand using a general birth-death process, and 
discusses briefly fill-at-relief problems. Work by Luss and Whitt!’ 
studies the impact of both deterministic and stochastic models on 
utilization. The authors use Brownian motion to model the stochastic 
demand and follow a scheme similar to ours for determining the margin 
needed at a future time. 

The organization of this paper is as follows. Section II sets up the 
problem and gives a summary of results. The explicit solutions for the 
general case are derived in Section III, and their properties are exam- 
ined in Section IV. In Section V, growth, turnover, and churn are 
defined, the concept of margin is introduced, and some of its applica- 
tions to capacity expansion problems are discussed. Finally, Section 
VI contains the conclusions. 

ll. BACKGROUND AND SUMMARY OF RESULTS 
2.1 General birth-death equations 

Consider a system described by a set of states n = 0, 1, ---, 0, and 
a birth and death process defined by a set of transition rates {An, Un}. 
The quantity 4,,6(,6) + o(6)* is the probability of a birth (death) in 
the small interval [¢, ¢ + 6], given that the system is in state n at time 
t.' The probability of more than one birth or death in [¢, ¢ + 4] is 0(6). 
The probabilities P,,(¢) of finding the system in state n at time ¢ must 
satisfy the well-known infinite set of difference-differential equations 
(p. 454 of Ref. 1) 


5 Patt = (An es Hn) P(t) + An-1 n—1(t) a Pn41Prsi(t) 
[for n=O, P-i(t)=0, wo=O]. (1) 


* o(-): R' > R' is such that lim ols) = 0. 
eo 6 
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If the initial number of circuits is no, the initial condition may be 


written 
P,,(0)= Snnos (2) 


where dnn, is the Kronecker delta. 

The particular birth-death processes considered in this paper are 
the cases in which the transition rates are proportional to the popu- 
lation, n, with or without immigration.**”° The corresponding transi- 
tion rates, defined for all nonnegative integers, n, are 


An = nd + B, [in = NB, (3) 


where A, p, and £ are nonnegative constants. In the following sections, 
results for the case with no immigration may be easily obtained by 
setting 8 = 0. 


‘2.2 Mean and variance 


It has been shown’*”° that the mean, m/(t), and the variance, u(é), of 
processes such as those described by eggs. (1) and (3) may be obtained 
without solving explicitly for the P,(t). The resulting expressions, 
satisfying initial condition (2), may be easily found to be 

(t) CaseX ~ ps 


_ B Apt __ B 
mit) = (n + 7 -)e H ear (4) 
v(t) = no A +p eT] — e O47 

A— py 

+E Nem + he (5) 
(it) CaseA = p 

m(t) = Bt + no, (6) 
v(t) = ABE? + (2Ano + B)t. (7) 


2.3 Solutions for P,(t) 
To simplify the notation, define the following quantities: 


A=A~4, 
2 At 
A=A(t) ee 
At __ 
B= Bi) er 
C=C “9 (8) 
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and 


(7) =a deem (9) 


m m! 


This definition of the binomial coefficient is valid for any real number 
rand any positive integer m (see p. 50 of Ref. 1). For m = 0, one defines 
(©) = 1, and for negative integers m, one defines (m) = 0. The symbol 
(m) is not used if m is not an integer. Denoting v = B/A, where p is any 
nonnegative real number, the solutions derived in this paper are 

(t) CaseAX Ap 


min {79,7} ee 
C'(uBY"(B)"—Y, i ce a ee ‘} 


i=0 n-1 
P,(t) = - 
() -(A-—1),if mt+v>0, (10) 
Sno; if No +v=0. (11) 


(ii) Case =p 


1 v Nt n+no min{ng,n} No 

1+At) \1+At & 
P,(t)=) [motn+v—i-1\f/ 1 _ 
n-i Ne 


no, if Nntv= 0. (13) 


1) i mtv>0, (12) 


Equations (10) and (11) with no immigration (v = 0) are identical to 
the results quoted by Bailey [Eqs. (8.47) of Ref. 10]. 


2.4 Application to capacity expansion 


In Section V, margin is defined as the capacity which must be built 
in excess of the mean to meet certain service requirements, and the 
percent margin is defined as the ratio of the margin to the mean in 
percent. The following is a summary of the main results: 

(1) By aggregating demands, less percent margin is needed than in 
the nonaggregated case. This effect is especially significant for small 
demands. 

(ii) Given a minimum desired time, T, between successive expan- 
sions, a procedure is established for determining the minimum capacity 
increment which would meet the given service requirements. 

(tit) Given a lead time, Tr, between the moment facilities are ordered 
and the time they are available for use, a procedure is established for 
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determining the threshold value of the remaining spare corresponding 
to the time at which new facilities should be ordered. 

(tv) By introducing immigration, the absorbing zero state is elimi- 
nated and the percent margin needed to meet the service requirements 
is reduced for moderately large to large times (of the order of two years 
or more for the particular values examined). 


Il. DERIVATION OF THE STATE PROBABILITIES 


The approach followed to solve the set of equations in (1) is the 
generating function technique.” In Ref. 10, a differential equation for 
the generating function, F'(s, t), defined below, is established and its 
solution is derived. The results are quoted in Section 3.1. Three well- 
known identities are given in Section 3.2 and are then used in Section 
3.3 to derive explicit expressions for the state probabilities. The pro- 
cedure followed in Section 3.3 is to identify F'(s, t) as the generating 
function for a convolution of two functions. 


3.1 The generating function 


The generating function, F'(s, ¢), is related to the state probabilities 
through the following expression: 


oo 


F(s,t)= ¥ s"P,(t), O<s<l. (14) 


n=0 


The differential equation for F'(s, t), given in eq. (8.63) of Ref. 10 with 
e'=s, is 


OO + Hs) A = Bis — Fs, 0), - 
ot Os 


where 
HA(s) = —(s — 1)(As — p). 


The solutions to this equation are 


A \’(/b+as\"° Ss a6 
d+es) \d+es/] ’ Hs 


(aa) ( 


* An alternative approach for obtaining this result is to substitute A — A for » in the 
\ ¥ p expression and to take the limit A > 0. 








F(s, t) = 


ar 
Qi 
iva) 


+ 








», A=upn," (17) 


Qa 
+ 
ty’ 
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where 


0<=s<=l, 

a=) — pe’, 

b = —p(1 — e*), 

c=X(1—e”), 

d=)e*'—un, (18) 
a=1-At, 

b =2t, 

c=—Ai, 

d=1+At. (19) 


The above solutions may be verified by direct substitution. Equation 
(16) agrees with eq. (8.71) of Ref. 10 and eq. (17) with v = 0 agrees with 
eq. (8.52) of Ref. 10. 


3.2 Useful identities 


In Section 3.3, use will be made of the three following well-known 
identities. 


3.2.1 Binomial identity 


For any a and f and for any nonnegative integer n, the following 
identity holds (p. 51, Ref. 1): 


(a+ B)" = ) & annem (20) 


m=0 m 


3.2.2 Negative binomial identity 


For any a and £ such that | 8/a|< 1 and for any real number r, the 
following identity holds (see pp. 51 and 269 of Ref. 1): 


(a— py" = 3 cur( {r )ararime 


If r is strictly positive, identity (12.4) on p. 63 of Ref. 1 may be used to 
write 


epee ( er pram (21) 


m=0 m 


3.2.3 Generating function for a convolution 


Let F'\(s) and F2(s) be the generating functions for the sequences 
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{P?}n=-01,-°+ and {P?},-0,1,-++, respectively, 


oo 


Fi(s) = ¥ s"P®, =Fr(s) = Yt s"P2. (22) 
n=0 


n=0 


The function F'(s) = F\(s)F2(s) is then the generating function for 
{Pn}n=0,1,-°+, the convolution of P{”? and P, and may be written as 


F(s) = ¥ SP. (23) 
n=0 
where 
P, = ¥ PPP, = 3 PPP, 
i=0 i=0 


The proof of this theorem is elementary (e.g., see Chapter 11 of 
Ref. 1). 

Note: This theorem applies to arbitrary sequences {P{”} and {P??} 
(not necessarily probability distributions) as long as their respective 
generating functions exist. Thus, the series in eqs. (22) must converge. 
For the purpose of this theorem, however, it is assumed that F(s) and 
F2(s) do exist. 


3.3 Derivation of explicit expressions 
3.3.1 CaseAX Ap 


The generating function in eq. (16) may be rewritten as follows: 
| F(s, t) = F,(s, t) F2(s, t), (24) 
where 
F,(s, t) = A’(d + cs)”, 


F2(s, t) = (6 + as)”. 


(a) not v>0 


Applying identity (20), it may be seen that F2(s, t) is the generating 
function for a binomial type function, 


F2(s, t) s (7 Jorm(asy 


m=0 m 


y s™P?(t), (25) 
m=0 
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where 


(™ Jonna if m=m, 
Pr (t) = 


0 if m>no. 


(26) 


In a similar manner, it may be seen from identity (21) that Fi(s, ¢) is 
the generating function for a negative binomial type function, 


F,(s, t)= A” y ( No tv+m— t)(-esynarintnn 


eee m 
= s™PY (t), (27) 

where 
P(t) = 7 +p = m— t(eeymarintntn, (28) 


It may be shown that | cs/d| < 1 for all values of t= 0,0 <s <1, and 
A, # = 0. Thus, identity (21) applies in all the relevant cases. 

It now follows from eq. (23) that F'(s, t) is the generating function of 
the convolution, 


P,(t) = ¥. P2(t)P2,(t) 
i=0 


7 min . No pro-igi 
i=0 L 


. ja-(” +yvtn-1i- Neceymtaroerren : (29) 


n-t 


where the upper limit on the sum arises from the condition i < no for 
P(t) established by eq. (26). 
Rearranging, one obtains 


A ¥ min {9,7} 
Pr(t) = (5) >, 
i=0 


(ers) Ca » 


From the definitions of a, b, c, and d in eqs. (18) and from eqs. (8), the 
above expression for P,,(t) reduces immediately to eq. (10). 


(b) not v=0 
For this case, F(s, tf) = 1. From the definition in eq. (14), it is then 
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apparent that all P,,(t) for n # 0 must be zero and that Po(t) = 1, 
which is the result shown in eq. (11). 


3.3.2 CasevX =p 


The solutions may be obtained by using the same procedure followed 
in Section 3.3.1. The only difference is that the starting equation 
should be eq. (17) rather than (16). Since eq. (17) can be simply 
obtained from eq. (16) by letting A > 1, a—> a, bb, c— é, andd 
— d, it follows that the final results for the case A = pcan be obtained 
from the results of the case  # p [1.e., eq. (30)] by making the above 
substitutions. Alternatively, eqs. (12) and (13) may be obtained by 
taking the limits of eqs. (10) and (11) as » — X. The procedure is the 
following: Consider A to be a constant, then replace p by A — A wherever 
it appears, and finally take the limits A — 0 using l’Hospital’s rule 
whenever necessary. The results of this limiting procedure are found 
to be 


ii 2 


MOS Tang ey 
inco=<—— 
dX 1+At ; 


IV. PROPERTIES 


In this section, the zero-state probability and the cumulative prob- 
ability distributions, for several choices of the parameters A, p, and £, 
are examined as a function of time. In addition, some cases in which 
the state probabilities are especially simple are indicated. 


4.1 Probability of ultimate extinction 


The birth-death process without immigration is characterized by an 
absorbing state at n = 0. If the system reaches that state at some time 
to, it will stay there for all t > % since the birth rate is zero. The 
probability of hitting that state at time ¢ is given by Po(t). In the limit 
t — oo, this probability tends to 


> A>up, 


“ons 
~ |= 
Ne 


lim Po(t) = (32) 
t— 00 
1, ASup. 
Thus, for any » # 0, there is a nonzerc probability of ultimate extinction 
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Fig. 1—Zero-state probability. 


of the process, while for 4 = A, ultimate extinction is a certainty. This 
feature may be removed, if desired, by introducing immigration. In 
this case, the limiting value of Po(t) tends to a nonzero value for \ < 
p. and to zero for A = p, 


(5) (¢) lime" =0, A> 4H, 


lim Po(t) = (33) 


AY 
1--|], ASU. 
(:-*) : 


The effect of immigration on Po(t) is shown graphically in Fig. 1, 
where Po(t) is plotted for various values of A, u, and », and for no = 1. 
The case no = 1 was chosen for clarity of the figure, since the effect of 
immigration is larger for smaller values of no. 


4.2 Cumulative probability distribution 


The cumulative probability distribution is defined as 
F,,(t) = BS P;(é). 
i=0 
In order that lim,_,.F,(¢) = 1 for all ¢, it is necessary and sufficient 
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that 7-0 An’ diverges, which is the case for the process discussed in 
this paper (see Theorem on p. 452 of Ref. 1). The function, F,,(¢), for 
various choices of A, #4, and £, and for no = 5 is shown in Figs. 2 through 
5. The lowest curve plotted in each figure is Po(t), and is consequently 
a measure of the extinction probability. 

The values of (0.6) and p (0.3) chosen in Figs. 2 and 3 correspond 
to net positive growth (see discussion of growth in Section 5.1). As 
may be verified from eqs. (32) and (33), the lim;_,..Po(t) is nonzero in 
Fig. 2 and zero in Fig. 3. In addition, for each n > 0, the lim:.,..P,(¢) 
= 0, although the lim... )iz-1 Pn(t) is nonzero. Thus, for any n > 0, 
the lim,_,..F,(t) is nonzero for 8 = 0 and zero for £8 # 0, reflecting the 
fact that the extinction probability is nonzero in the first case and zero 
in the second. (The n = 0 curve in Fig. 3 is essentially flat and is hard 
to distinguish on the graph.) 

The case in which the death rate is larger than the birth rate is 
shown in Figs. 4 and 5. This situation corresponds to negative growth. 
As may be verified from eqs. (32) and (33), the lim;,..Po(t) is 1.0 in 
Fig. 4 and 0.5 in Fig. 5. It may be shown that, for the B = 0 case, 
lim,..P,(t) = 0 for all n > 0, while for the 8 ¥ 0 case, lim;,.Pn(t) ¥ 
0 for all n = 0. In both cases, however, the lim,_,..F,(t) is nonzero, 
reflecting the fact that the extinction probability is nonzero. 

Finally, the case \ = p, B = 0, is similar to Fig. 4 (with extinction 
probability equal to unity), and the case A = p, B ¥ 0, is similar to Fig. 
3 (with extinction probability zero). These cases are not shown. 


Fr(t) 





TIME IN YEARS 


Fig. 2—Cumulative distribution without immigration (A > p). 
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Fig. 3—Cumulative distribution with immigration (A > p). 


4.3 Special case solutions 


For certain initial conditions, the general solutions reduce to simple 
analytical forms. For no = 0, the interesting process is the one including 
immigration (vy = 1). The state probabilities of eq. (10) may then be 


1.20 






Fa(t) 0.60 
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Fig. 4—Cumulative distribution without immigration (A < p). 
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Fig. 5—Cumulative distribution with immigration (A < y). 
written 


P,(t) = craay'(” ae ‘). (34) 


For mo = 1, the process without immigration becomes interesting. 
Equation (10) with v = 0 reduces to the well-known form*’”” 


Po(t) = pB, 
Pr(t) = (uB)(AB)"A = (1-AB)(1—pB)(AB)""* s(n #0). (35) 


V. APPLICATIONS TO CAPACITY EXPANSION 


In this section, the A, », and 8 parameters of the model are related 
to more physically intuitive quantities such as growth and turnover 
(or churn). The concept of margin, the extra capacity needed to meet 
the demand within a given held-order probability, is introduced. The 
effects of randomness and immigration on the margin are then exam- 
ined, and finally, several capacity expansion problems are addressed. 


5.1 Growth, churn, and turnover 


The model described in the preceding sections is completely speci- 
fied once the parameters A, pw, and £ are known. In this section, 
quantities that are more physically intuitive than the birth and death 
rates, namely growth and turnover (or churn), are introduced and 
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related to A, p, and £. First define the following quantities: 


b(t) = EB(t) = mean number of births in [0, ¢], (36) 
d(t) = ED(t) = mean number of deaths in [0, ¢]. (37) 
Then, 
m(t) — no = b(t) — d(t) 
= mean net population increase in [0, ¢], (38) 


where E refers to the expected value and where m/(t) is the mean value 
of the population, as defined in eqs. (4) and (6). Differential equations 
for b(t) and d(t) are derived in Appendix A and exact analytical 
solutions for these equations are found. 

The annual rate of growth, g, is defined as the change in the mean 
number of circuits in one year divided by its value at the beginning of 
the year. Thus, 

m(t + 1) — m(t) 


g(t) =" “gy (39) 


From eqs. (4) and (6) it may be seen that 


[7 + B/(A — w)Je®(e** — 1) 
SETS W 7 GRIN CBE (e375 YYW RG for 2X 
[no + B/A— W]e" = B/A—n) ee 


g(t) = (40) 





for A=u. 





no + Bt 


By observation, it may be noted that if B = 0, the growth is time 
independent, whereas if 8 > 0, the growth depends on time. A time 
average value of the growth may be defined to be 


1 T 
g = lim z| g(t) dt. (41) 
Tox T 0 
It may be easily found that 
0, for A<p, BHO, 
&= +0, for A=4H, (42) 
e*#*—1 for all other cases. 


When JA = yp, it must be borne in mind that g = 0 does not necessarily 
mean no growth. In fact, for the B > 0 case there is linear growth at 
the rate B [recall eq. (6)]. Thus, it is the immigration factor that 
represents the growth in this case. 

In summary, throughout this paper, the last expression in (42) will 
be used as the definition of growth with the provision that it is the 
variable B that actually describes the growth in the A = u case. The 
case A <p and £ > 0 will be disregarded. 
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Churn may be defined in various ways. Its purpose is to quantify the 
“activity” of the process, i.e., to compare the number of “connects” 
with the number of “disconnects.” For instance, churn may be defined 
as the ratio of the mean number of fotal connects in one year to the 
mean number of net connects in the same period. This ratio has also 
been called in-to-net, or in-to-gain ratio.'® Thus, 


w(t) = St = 8) _ aN 


~— m(t+1)—m(t) NET’ 7 


This quantity has the easy interpretation of being the expected number 
of connects in one year for a net increase of one circuit. For instance, 
a churn of four (which seems to be a typical number'®) means that 
four connects are expected for every net increase of one circuit. Of 
course, it follows that three disconnects are also expected for consist- 
ency. The problem with this definition of churn is that for low-growth 
cases (i.e., when the net increase is almost zero) the ratio of total 
connects to net may become a very large number. Furthermore, in the 
case of negative growth, this ratio becomes negative. Thus, the range 
of values which the churn may take is very large, which makes it a 
difficult number to work with in data analysis. 

For this reason, an alternative definition of churn is introduced and 
is called turnover. Turnover is the expected number of connects 
(disconnects) needed to replace the number of disconnects (connects) 
that occurred in one year, divided by the expected number of circuits 
in place at the beginning of the year. The words outside the parentheses 
refer to the positive-growth case in which there are more expected 
connects than disconnects, and the words in the parentheses refer to 
the negative-growth case when the reverse is true. The turnover may 
be written as 


1 IN + OUT — |IN— OUT 


a(t) = (44) 
MEAN 
_ min(IN, OUT) (45) 
MEAN 
where 
IN = b(t + 1) — (2), 
ouT = d(t + 1) — d(o), (46) 


MEAN = m(b). 


Thus, a turnover of 0.3 with positive growth indicates that over the 
next year the expected number of disconnects will be equal to 30 
percent of the mean at the beginning of the year. The expected number 
of connects depends on the growth and will be greater than or equal to 
the disconnects. (Note that it is not necessarily the same circuits that 
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are connected and disconnected.) From the expressions in Appendix 
A, eq. (44) may be written as 


4+ (nee*! — B/A)[nene*(e* — 1) (A + p)/A 
— 2Bu/A — neve“’|(e* — 1)]] (A ¥ yp), 
a(t) = (47) 
1 QABt + AB + 2Wno _ 
- OO ;yW (A = LB), 
2 No + Bt 


where 


Nett = No + ——. 
al a war , 
Again, by observation, it may be noted that if 8 = 0, the turnover is 
time independent, whereas if 8 > 0, it is time dependent. As in the 


case of the growth, a time-average value of the turnover may be 
defined to be 


1 (7 
a= lim z| a(t) dt. (48) 
T— 0 T 0 


It may be easily found that 


Fle 1) if A\>pn, 
a= Fie if A<p, B=0, (49) 
U for all other cases. 


Throughout this paper, the above expressions for the turnover will be 
used. As mentioned before, the case A < p and £ > 0 will be disregarded. 

From preliminary data analysis, it has been found that typical values 
of growth and turnover for special services fall into the range —0.15 to 
+0.15 for the growth and 0 to 1 for the turnover. Nominal values of g 
= 0.1 and a@ = 0.3 were chosen in this paper. 

To do a study using stochastic special services demand, numerical 
values of the parameters of the modei are needed. Accurate values, if 
such values exist, may only be found by careful data analysis of 
historical demands. Approximate values, however, may be found as 
follows. First equate the mean [eqs. (4) or (6)] to the special services 
forecast to obtain values for A and 8. Then equate the variance [eqs. 
(5) or (7)] to some measure of the forecast uncertainty to determine 
A and p. For some cases, given below, one may conveniently use eqs. 
(42) and (49) to write the mean as a function of growth alone, and the 
variance as a function of both growth and turnover. The results are as 
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follows. 
(t) Exponential growth with no immigration (A + p, B = 0) 


m(t) = no(1 + 8)’, (50) 


u(t) = no (2 + 1) (1+ g)[(1 + g)' — 1, 


where the positive (negative) sign refers to A > p (A <p), 
(ti) Linear growth (A = p, 8 = 0) 


m(t) = Bt + No, (51) 
v(t) = aBt? + (2ano + B)t. 
5.2 Margin and minimum capacity increments 


Define the quantity h(t) as the probability of a held order, i.e., the 
probability that at least one service order is delayed due to lack of 
spare facilities. Then 


A(t)= YO P(t) =1- Fale), 
n=d+1 
where d = d(t) is the total number of servers (facilities) at time t. The 
quantity A(t) is similar but not identical to the transient time conges- 
tion function (see Appendix B). Computationally, d(t) may be deter- 
mined from the state probabilities by requiring A(t) to be less than or 
equal to some predetermined number, h. Thus 


d(t) = min{d = 0,1, ---| y P,,(t) = h}. (52) 
n=d+1 


Of course, the actual sum involved in the computation is not infinite, 
since the condition in eq. (52) is equivalent to )\2-0 P,(t) = 1—h. The 
level d(t) can be viewed as the sum of the mean number of circuits 
and a quantity which may be called margin. Thus, given any time ¢ > 
0, the margin is the capacity which must be built at f& = 0 in excess of 
the mean m(¢), in order to meet the demand, within the maximum 
held-order probability, h. 

A significant quantity is the ratio of the margin to the mean in 
percent which will hereafter be referred to as the percent margin. 
Figure 6 shows a plot of this ratio as a function of time for various 
values of growth and no immigration. Turnover has been taken to be 
0.3, the initial number of circuits 5, and the maximum probability of a 
held order 0.05. As may be seen, the percent margin increases with 
time, and generally less percent margin is needed for larger growth 
rates. The sensitivity of the percent margin with respect to turnover, 
for a growth of 0.1, is shown in Fig. 7. It may be seen that the larger 
the turnover, i.e., the larger the “activity” in the network, the more 
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Fig. 6—Sensitivity of percent margin to growth (no immigration). 


margin one has to build to provide the same maximum held-order 
probability. 

The concept of margin has several applications, one of which is the 
determination of an appropriate capacity increment at each expansion. 
Given a minimum desired time, 7, between expansions, it would be 
useful to determine the minimum capacity increment, c, which, if 
installed at time ¢, will exhaust in [¢, ¢ + T'] with a probability that is 
no larger than h, or equivalently, the increment which will last the 
interval T with a probability greater than or equal to 1 — h. Thus, the 
condition on c may be written as 


Prob{. W(t + £) Sno t+ec|V(t)=n, VEEl[O,T]}=1—h. (53) 


Since the A and p coefficients are time independent, the process is time 
homogeneous. Consequently, changing the origin of time does not 
affect the problem. Choosing it to be at ¢ is equivalent to setting t = 0 
in the above expression, and determination of c reduces to finding 


n=0 


min| e= 0 1, --- 
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Tables may be set up permitting direct reading of the values of c 
corresponding to the growth, turnover, initial state parameters, and to 
the time interval T. Some typical results are plotted in Fig. 8. 

For completeness, it must be mentioned that in problems of the type 
described above, questions about service-order queue disciplines must 
be entertained. A careful consideration of this aspect of the problem is 
beyond the scope of the present analysis, and the queue discipline 
implicitly followed has been Blocked Customers Held (BCH). See 
Appendix B. 


5.3 Effect of randomness: Aggregation benefits 


An inspection of Fig. 6 shows that the percent margin needed is 
large, for the particular case examined. Since the initial state consid- 
ered is rather small (7 = 5), an interesting question is to find out 
whether the percent margin can be reduced by combining demand to 
form larger quantities, in the hope that the statistics of the aggregated 
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Fig. 7—Sensitivity of percent margin to turnover (no immigration). 
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process will be better behaved, i.e., less susceptible to fluctuations. Our 
studies have shown that benefits are indeed obtained by aggregation. 
Figure 9 illustrates the results. The time evolution of the percent 
margin is plotted as the initial state of the system is varied, for fixed 
values of growth, turnover, held-order probability, and for no immigra- 
tion. Two important observations may be made. The first one is the 
fact that the percent margin decreases as the initial state of the system 
increases. An implication of this behavior, for example, is the following: 
Suppose demand between two points is being satisfied by two inde- 
pendent routes (with initial number of circuits n§” and n§, respec- 
tively). Benefits would be obtained by combining the two demands on 
one route (with initial number of circuits n§ + n§?) because the 
margin one would have to build in this case is less than in the 
nonaggregated case, the held-order probability being the same. The 
second observation is that the change in the percent margin with 
respect to mo is larger for small values of no. The implication is that the 
benefits will be especially significant when aggregating small demands. 

It must be mentioned that the conclusions about aggregation 
benefits in the example given above were based on an examination of 
Fig. 9. The implicit assumption was that the combined process would 
obey the same birth and death equations as each single process, and 
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Fig. 8—Minimum capacity increments. 
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Fig. 9—Sensitivity of percent margin to initial demand (no immigration). 


that it would be described by the same pair of transition rates, A and 
p.. This assumption is only true if all the individual processes are of the 
same type, i.e., if they all have identical transition rates. It turns out, 
however, that the method for determining margin described in the 
previous sections may be extended with little additional effort to the 
general case in which there are M simultaneous processes characterized 
by a set of transition rates {X‘, '} for i= 1, ---, M. Since the processes 
are independent, the joint probability distribution is the product of 
the individual distributions, 


Pigg) =. PY (t)P? (t) baad PY) (e). (55) 


The probability of being in some level 71, regardless of the composition 
of that level, may then be written 


P(t) =’ WY ee SE POMPE (D «+» PLOW, 
ny Ng ny (56) 
NtnNgt-e>> +nyHn. 


The sums are over all values of m;, nz, --+, nw such that n; + no + 
-++ + ny =n. The primes over the summations are an indication of 
this restriction. The margin for the combined process may then be 
determined from the mean m(t), and the quantity d(t), analogous to 
that defined in Section 5.2. The mean for the combined process is 
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simply the sum of the individual means, 
m(t) = mi(¢) + m2(t) + --- + mm(t), (57) 


and d(t) may be obtained from an expression similar to eq. (52), 
namely, 


d(t) ~ min{d =o, 1, --- 





5 P(t) s h}. (58) 


n=d+1 


5.4 Effect of immigration 


Immigration affects the problem in several ways. First, it eliminates 
the absorbing state at nm = 0, and consequently the probability of 
extinction, for all cases except the A < pn, B ¥ 0 case. Furthermore, the 
lim;.. P(t) = 0 (n > 0), for all cases except the case mentioned above, 
for which the limit is nonzero. Finally, a nonzero value of 8 gives the 
model flexibility to represent linear growth (for A = p) as well as 
exponential growth (for \ ¥ yp). It is interesting to note that for the A 
> p case, immigration actually reduces the percent margin for moder- 
ately large to large times, as seen in Fig. 10. This behavior is due to the 
fact that introducing immigration increases the mean (which tends to 
decrease the percent margin) faster than it increases the variance 
(which tends to increase the percent margin). 
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Fig. 10—Effect of immigration on percent margin. 
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5.5 Other applications: Spare threshold 


In capacity expansion problems, a question that arises is when to 
expand. If facilities could be installed instantaneously, the expansion 
time would be simply the time at which remaining spare exhausted. 
However, there usually is a lead time, r, between the moment facilities 
are ordered and the time they are actually installed. If existing spare 
can be monitored, it would be useful to determine a priori what the 
particular level of spare would be at the time when new facilities 
should be ordered. This information would then yield the order time, 
since as soon as that spare level is attained, it is time to order. This 
value of spare, or spare threshold, o, is the amount of remaining 
capacity which will exhaust in [¢, ¢ + 7] with a probability that is no 
larger than h, or equivalently, the amount of capacity which will last 
the interval + with a probability greater than or equal to 1 — h. If the 
last expansion occurred at f, providing a total capacity of C(t) (see 
Fig. 11), the constraint on o may be written as 


Prob{ V(t + €) = C(t) | 
N(t) = C(t) —o, VEEl[O,T]}}=1—h. (59) 
Time homogeneity of the process allows setting ¢ = 0 in the above 


expression, as discussed in Section 5.2, and the determination of o 
reduces to finding 


min { o= 0 1, --- 





C(to) 
dy PAg)=1—h, VéeE[O, rh. (60) 


n=0 
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NOMINAL CURVE PARAMETERS 
GROWTH = 0.1 
TURNOVER = 0.3 
h = 0.05 
vei TURNOVER = 1.1 
CAPACITY = 24 CIRCUITS 


NOMINAL CURVE 


SPARE THRESHOLD 
(AS A PERCENTAGE OF THE CAPACITY) 


CAPACITY = 48 CIRCUITS 





LEAD TIME IN MONTHS 
Fig. 12—Spare threshold as a function of lead time. 


where o appears implicitly in the initial condition used to evaluate the 
state probabilities P,(£). Again, tables may be set up permitting the 
direct reading of the values of o corresponding to the growth, turnover, 
capacity levels, and the lead time, 7. Some typical results are plotted 
in Fig. 12, where o is shown as a percentage of the total capacity. 


VI. CONCLUSION 

A summary of the main results has been given in Section IT. Explicit 
solutions for the birth-death process in which the births and deaths 
are proportional to the state have been derived, and some of their 
applications to capacity expansion problems have been discussed. A 
method for determining margin has been described in detail for the 
case in which one birth-death process exists with exponential growth 
characterized by a pair of transition rates, \ and p, and it was shown 
how to extend the method to cases in which there were several 
simultaneous processes. 

In addition to its applications to capacity expansion problems, the 
proposed model is useful in assessing the potential of routing strategies 
for special services. It has already been shown, for example, that 
benefits are to be expected by aggregating small demands. These 
conclusions were based largely on service robustness considerations. 
To obtain more comprehensive results about routing strategies, it is 
clear that cost robustness considerations must be addressed as well. 
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APPENDIX A 
Expressions for b(t) and d(t) (Section 5.1) 


As indicated in Section 2.1, 
And + 0(6)* = Probability of a birth in [¢t, t + 6] given that 
the system was in state n at time #¢, 
Hind + 0(5) = Probability of a death in [¢, ¢ + 5] given that 
the system was in state n at time ¢. 


Letting -V (t) be the random variable representing the number of 
circuits at time ¢, the total probability of a birth in [¢, ¢ + 6] may be 
written as 

Prob {a birth in [¢, t + 6]} 


= )' Prob{a birth in [¢, ¢ + 6] and ./(t) =n} 


n=0 


= ¥) Prob{a birth in [¢, t + 6]| ”(t) = n}Prob{V(t) = n} 
= ¥ [And + 0(5)]Pr(t), (61) 


where the certain event and Bayes’ rule were successively used. Simi- 
larly, the total probability of a death in [¢, ¢ + 6] is ¥, [pnd + 0(8)] 
P,,(¢). With this information, a differential equation for b(t) may be set 
up as follows: 


b(t + 6) = b(t)[Probability no event or a death] 
+ [ b(¢) + 1][Probability of a birth] + o(6), 
where o(6) is the contribution of more than one birth. 


b(é + 8) = B(é)[ {1 — ¥ (And + pnd + 0(8))P,} 
+ ¥ (un5 + 0(5)) Pr] 


+ [ b(t) + 1] ¥ [A.6 + 0(8)]P,(¢) + 0(8). (62) 


o(8) 


* o0(-): R' > R' is such that lims.o 5 = 0. 
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In the limit 6 — 0, eq. (62) becomes 
d foo} 
Co = nin t . 63 
7 b(t) a P,,(¢) (63) 


In a similar manner, the following differential equations for d(t) and 
for m(t) may be obtained: 


d foo} 

a d(t) = x PnP, (t), (64) 
© m(t) = (Xn — Ha)Pall) (65) 
dt m = o, n— BnIL nC). 


Since 


o 


m(t) = > nP,(t), 
n=0 
one can use relations (3) to write eq. (65) as a differential equation for 
m(t). Its solutions have already been given in eqs. (4) and (6). With 
knowledge of the mean, eqs. (63) and (64) may be solved for b(t) and 
d(t), respectively. The results are easily found to be 


But 


A 
b(é) 7 pio 7 a . (66) 
~ | Ape? 
“ET + (Xn Blt (A= 0), 
t 
F ner(e™ — 1) — x (AA), 
d(t)=) \,,2 (67) 
“er + Anot (A = p), 
where 
Nett = No + F 
and 
A=A-p 
APPENDIX B 


Queue Discipline and Held-Order Probability 


In the model described in this paper, the queue discipline followed 
is Blocked Customers Held (BcH). Let the random variable T denote 
the sojourn time of the customer, i.e., the total time he spends in the 
system, either waiting for service or being served. The assumption 
inherent in the BCH queue discipline is that the customer will spend 
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time T in the system, after which he will depart, regardless of whether 
he has been served or is still waiting for service. The choice of p, = 
n » implies that the sojourn times have a negative exponential distri- 
bution. 

In special services, if the sojourn time distribution is in fact negative 
exponential, then the queue discipline used here should be correct. If, 
on the other hand, it is the service-time distribution that is negative 
exponential, then the BCH queue discipline assumed here may still be 
approximately correct if the average waiting time of a customer is 
much smaller than the average service time. 

To compute the held-order probability, care must be given as to 
whether the held order is seen by an outside observer or by an arriving 
customer. For processes with Poisson input, it is well known that the 
distribution P,(t) seen by an outside observer is identical to the 
distribution 7,,(¢) seen by an arriving customer (see Section 3.2 of Ref. 
3), and hence the distinction is unimportant. For the process described 
in this paper, however, the distributions are different. Define 


P,,(t) = Prob{. V(t) = n}, (68) 
tn(t) = Prob{./(t) = n|a customer arrives at t*}. (69) 
Expression (69) is the probability that a customer who arrives at ¢ 
finds n other customers being served or waiting to be served. Letting 
the event A refer to the arrival of a customer in the interval (¢, t¢ + 
5], and using conditional probabilities, one may write 7,(¢) as the 
following limit, if it exists: 
lim Prob{.W/(t) =n, A} 
tm (t) = —— 
lim Prob{A} 
5-0 
— lim Prob{A | /(t) = n}Prob{.W(t) = n} 


rs y Prob{A | 1(t) =7}Prob{W(t) =7} | 
j=0 


Since Prob{A |. /(¢) = n} = A,6 + o(4),* one obtains, with the help of 
(68), 





Pe ash t 

pea (70) 
> AVP (t) 
j=0 

The held-order probability may thus be defined as 

h(t)= > P,(t) as seen by an outside observer (71) 
n=d+1 
* o(-): R' > R’ is such that lims.o “ = 0. 
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or 


w(t) = ase NO b (72) 
‘(t) =————__ as Seen by an arriving customer, 
De MPO) e : 


where d is the number of servers. Expression (72) is the conditional 
probability that if a customer were to arrive at ¢, he would find all the 
servers engaged. This quantity is known in congestion theory as the 
transient call-congestion function.’ Expression (71) is the probability 
that at least one customer is waiting to be served. This quantity is 
similar but not identical to the transient time-congestion function, 
S(t), which is the probability that all servers are busy at time ¢, and 
which may be written as ° 


S(t) = ¥ P(t). . (73): 


n=d 
For Poisson input (A, = A), it may be shown that 
h’'(t) = S(t) > h(t). 
For the Kendall process, the relationship is 
h'(t) > S(t) > h(t). 


For the Poisson input case, since S(t) is equal to h’(t), the time- 
congestion function may be used as a meaningful measure of the held 
orders. For the Kendall process, on the other hand, S(t) does not 
describe the held orders as seen by either an arriving customer or an 
outside observer. Consequently, the time-congestion function is not 
believed to be a meaningful measure of the held orders. Throughout 
this paper, expression (71) was used for the held-order probability, 
although eq. (72) could have been used instead. 

Both A(t) and h’(t) are instantaneous quantities. Since the Kendall 
process with A > p is not ergodic (i.e., space averaging is different than 
time averaging), a space average must be made when measuring either 
of these quantities. Thus, one cannot measure A(¢) or h’(t) by exam- 
ining one sample for a long enough time; rather, one needs an ensemble 
of samples. Because of these measurement difficulties, an open ques- 
tion remains as to whether this type of held-order probability is the 
best measure of the provided service, or whether other quantities such 
as the time average of h(t) or h’(t), or the duration of the held order 
might be more meaningful. Nevertheless, it is clear that eqs. (71) and 
(72) are some measure of the provided service and, as such, are useful 
when comparing different special services provisioning methods meant 
to provide the same level of service. 
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High-Frequency Behavior of Waveguides with 
Finite Surface Impedances 


By C. DRAGONE 
(Manuscript received January 25, 1980) 


We derive the modes inside a cylindrical waveguide of finite surface 
impedances, assuming the waveguide transverse dimensions are 
large compared to the wavelength \. This paper restricts its consid- 
eration to the modes with B = k, where B is the propagation constant 
and k = 2r/X. For these modes we show that asymptotically, for large 
values of k, the field ~ becomes infinitesimal (of the same order of 
1/k) at the boundary. Taking this into account, we obtain simple 
expressions for the asymptotic properties of y for large k. The theory 
applies to a variety of waveguides: corrugated waveguides, optical 
fibers, waveguides with smooth walls of lossy metal, and so on. An 
important property of the modes considered here is that their atten- 
uation constant is very low, i.e., asymptotic to 1/k’ for large k. Thus, 
these modes are useful for long-distance communication. Another 
consequence of ~ — O at the boundary is that for large k the 
distribution of inside the boundary is essentially independent of the 
boundary parameters, i.e., independent of the surface impedances in 
the longitudinal and transverse directions. This consequence implies 
that the same radiation characteristics of the corrugated feed can be 
obtained using other structures and, therefore, construction can be 
simplified in many cases, with little sacrifice in performance. We also 
derive general expressions for y and the propagation constant B. 


1. INTRODUCTION 


It is known’ that in certain waveguides the field becomes, under 
certain conditions, very small at the boundary. Consider, for instance, 
a corrugated waveguide of radius a and let A be the free-space wave- 
length. This waveguide is characterized at the boundary by finite 
surface impedance Z, in the longitudinal direction. The frequency 
dependence of Z., which is determined by the depth of the corruga- 


89 


tions, causes the transverse field distribution (x, y) of a mode to vary 
with the frequency k = 27/\. However, this frequency dependence 
virtually disappears (for all modes except surface waves) if the wave- 
guide dimensions are large enough. In fact, one finds that y(x, y) 
approaches for ka — © a frequency independent distribution that 
vanishes at the boundary.” This behavior is responsible for the low 
attenuation constant, for the excellent radiation characteristics, and 
the wide bandwidth of corrugated waveguides.” We show here that the 
same behavior also occurs, under quite general conditions, in a variety 
of uncorrugated waveguides.'”’ Figures 1a and 1b show two examples, 
a dielectric waveguide’*'*"* of general cross section and a hollow 
waveguide with metal walls coated by a dielectric layer.*’’ Other 
examples can be obtained by modifying the boundary conditions in a 
variety of different ways. For instance, several dielectric layers may be 
used in Fig. 1b, or a metal grid of transverse wires may be placed at 
the boundary, as pointed out in Section II. Other examples are the 
waveguides of dielectric or lossy metal considered in Ref. 2. We now 
outline the main assumptions. 

Consider a cylindrical waveguide with an interior region of homo- 
geneous and isotopic material, as in Fig. 1c. Let Z and k be the wave 
impedance and propagation constant for a plane wave in the interior 
region, and let C denote the boundary. Then 


Z= vt k= wen. (1) 


Consider a mode with propagation constant 6 and let E, denote the 
transverse component of the electric field, 


E, = v(x, ye, (2) 


Let 2a be a characteristic dimension of the waveguide, for instance the 
width in the x direction as in Fig. 1c. We are concerned about the 
asymptotic behavior of ¥(x, y) for large values of ka. Consideration 
will be restricted to the modes for which the propagation constant 8 
approaches k, as ka > ». Thus, we assume 


Bok for ka—o, (3) 


This excludes surface waves, as pointed out in the following section. 
Then, a property of the modes considered here is that E and H become 
transverse, in the limit as ka > ~, 


lim E, = H, = 0. (4) 


kha-o 


Another property is that asymptotically, per large ka, a set of linear 
relations exist at the boundary among the tangential components of E 
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Fig. 1—Three examples of cylindrical waveguides: (a) a dielectric rod, (b) a waveguide 
with metal walls coated by a thin dielectric layer, and (c) a waveguide with boundary 
conditions shown in Fig. 2. 


and H. It is convenient to write these relations in the form 


oe | -i011_ 7x | on C, (5) 


z 


where [H] is a 2 X 2 matrix and H,, E, denote the components of H, 
E in the direction of the unit vector 7 in Fig. Ic. These relations 
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together with eq. (4) give at the boundary 
lim E,= H,=0 on C, (6) 


ka>o 
provided the matrix [H] does not diverge for ka > ~, 
[H]4#o for ka— o, (7) 


Throughout this article, we assume conditions (3), (5), and (7). 
Condition (5) is discussed in the following section, where it is pointed 
out that for most waveguides considered here, [H ] is a diagonal matrix. 
In this case it is convenient to define at the boundary surface imped- 
ances Z, and Z, by writing 


E, = Z,H:z, E, = —2.H,, (8) 


where Z, is the longitudinal impedance, and Z, is the transverse 
impedance. Then condition (7) requires 


Z,,1/Z, ## © for ka—>o., (9) 


It is important to realize that this requirement is violated in a number 
of cases. It is violated in a hollow waveguide with metal walls of perfect 
conductivity, since then Z, = 0. Furthermore, in a corrugated wave- 
. guide with corrugations of depth d, the longitudinal impedance Z, is 
determined by kd, and there are certain frequencies for which Z, = 0. 
A similar situation arises in Fig. 1b where both Z, and Z, vary with kd. 
Throughout this article it will be assumed that the quantities 


1 1 Z, 1Z 
ka kaZ’ kadZ, 
are small. Therefore, the results will not apply in the vicinity of the 
above frequencies. 


A direct consequence of condition (6) is that the boundary values of 
w vanish, in the limit as ka > ©, 


jim W(x, y) = 0 on C. (10) 


Another consequence is that ~ approaches a distribution y.. indepen- 
dent of ka, for ka —> o. For finite ka, 


y= do + dy, (11) 
where dy (but not y..) varies with ka and 
jim oy = 0. (12) 


Notice condition (10) implies that 
Yo = 0 on C. (13) 
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These results are of practical interest for several reasons. In the design 
of a feed,° it is desirable that the boundary values of be small [as 
implied by eq. (10)] since these values determine radiation in the side- 
lobes due to edge diffraction at the aperture. Furthermore, for broad- 
band applications, it is desirable that the frequency dependent part 
éw of the aperture illumination be small, as implied by eq. (12). Finally, 
in a corrugated waveguide, or a waveguide with metal walls coated by 
dielectric layers, power is lost only at the walls and, therefore, it is 
determined by the boundary values of wy. Since these boundary values 
vanish for ka > ©, the attenuation constant for the above waveguides 
for large ka is very small’”*'**'; it is asymptotic to ka~*. We shall see 
that in general y.. is independent of [H] and, therefore, a variety of 
different waveguides, with different [H ] but the same boundary shape, 
give rise to the same yj... This explains the similarity, noted in Ref. 9, 
between the modes of a corrugated waveguide and those of an optical 
fiber, a dielectric lined waveguide,‘ and a hollow dielectric waveguide.” 
This similarity implies that essentially the same radiation character- 
istics of corrugated waveguides can also be obtained with a variety of 
other waveguides. 

In the particular case of the optical fiber, some of our results are 
implied by the asymptotic expressions derived in Ref. 7. Exact solu- 
tions for the modes of the corrugated waveguide,” the optical fiber,’® 
and the hollow waveguide of dielectric’ are known for circular geom- 
etry. For a rectangular cross section, only approximate solutions*” are 
known, except in special cases.” Exact solutions for the slab waveguide 
are given in Refs. 8 and 24. In all these cases one finds that condition 
(3) implies condition (10). Measurements of the radiation characteris- 
tics of a dielectric horn are described in Refs. 25 and 26. Some of 
the properties derived here apply also to propagation in stratified 
media.”’*? The use of surface impedances to characterize a boundary 
is discussed in Ref. 30. 


Il. BOUNDARY CONDITIONS FOR ka — 


We now derive and discuss eq. (5). Figure 1c shows a waveguide 
directed along the z axis and of general cross section in which p is the 
outwardly directed normal and 7 is a unit vector tangential to the 
boundary, 


T=1, Xv. (14) 


The medium inside the boundary is assumed to be homogeneous and 
isotropic. Let C denote the closed contour of the boundary in the plane 
z= 0. 

We are concerned with the properties of ¥(x, y) in a waveguide of 
large transverse dimensions. Thus, consider a mode propagating in the 
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waveguide of Fig. 1b and suppose the width 2a is increased keeping 
the dielectric thickness d fixed. The resulting behavior of (x, y) as ka 
— o can be derived exactly in two cases, when (see Appendix C) 


—=0 (15) 


and when the boundary is a circle. In both cases one finds that for 
most. of the modes £ — k, as ka > «~. For these modes, the normalized 
field amplitude ¥(x, y)/Y(0, 0) becomes infinitesimal at the boundary 
for ka — o, For the remaining modes, those for which B does not 
approach k, just the opposite behavior takes place: The field becomes 
confined to the immediate vicinity of the walls, degenerating into a 
surface wave with propagation constant determined by the surface 
impedances of the walls. Here, consideration will be restricted to the 
modes satisfying condition (3). An important property of these modes 
is that asymptotically, for large ka, the surface impedances Z, and Z, 
become independent of ka. In fact, if one writes 


Z. 2 


d | Z’ Y IZ (16) 
then in Fig. 1b 
1 
xX > ———_ tan(Vn? — 1 kd), (17) 
Vn? —1 
2 1 
Y= (18) 


n 
Vn? —1 tan(Vn? — 1 kd)’ 


as shown in Appendix C. Thus, Z, and Z, depend only on the refractive 
index n and the thickness kd of the dielectric layer. 

For a circular boundary, the above relations can be derived rigor- 
ously by expressing the field in terms of Bessel functions, and then 
making use of well-known expressions giving the asymptotic behavior 
of these functions for large arguments as in Ref. 5. They can also be 
derived by the following argument, which applies in general to a 
boundary of arbitrary shape (Fig. 1c). Consider the field in the vicinity 
of a boundary point P in Fig. 1c. Since ka is large, the curved boundary 
can be approximated locally by the tangent plane. Furthermore, since 
B = k, the field is produced locally by a plane wave at grazing — 
incidence. If one determines the law of reflection for such a plane 
wave, and takes into account that the plane of incidence is parallel to 
the z axis, one obtains™”’ eqs. (16)-(18). 

The above argument can be used to derive the asymptotic behavior 
of Z, and Z, for a variety of other waveguides, illustrated in Fig. 2.* In 
case (d) the boundary is a smooth surface of lossy metal with surface 


* Notice ko denotes & in free space. 


94 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1981 


resistance R,. In (a) the metal surface is corrugated. In (b) the medium 
outside the boundary is dielectric with refractive index nz < n,. This 
corresponds to the optical fiber of Fig. 1a. In (c), m2 > m as in Ref. 2, 
and, therefore, both Z, and Z, are real, which implies power is lost 
through the boundary. Other boundaries of practical interest are 
obtained from Fig. 2 by placing a grid of thin wires tangent to 7 on the 
boundary. This will cause Z, = 0 in all cases. For all these waveguides 
one finds, for large ka, that [H] in eq. (5) is a diagonal matrix, 


[H]=- 
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Fig. 2—Asymptotic values of the surface impedances Z, and Z, for different boundary 


conditions. 
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and the coefficients X and Y are determined by the surface impedances 
given in Fig. 2. Notice there are cases where [H] is not a diagonal 
matrix, as pointed out in Section VIII. 

In the following sections we consider a waveguide with boundary 
conditions given by eq. (5) and derive simple expressions for the 
dependence of ¥(x, vy) upon the matrix [H]. These expressions are 
obtained neglecting terms of order higher than 1/ka and, therefore, 
they do not require an exact knowledge of [H], but only of the 
asymptotic behavior of [H], which can be determined as seen in this 
section. In Appendix A, a procedure for determining y to any desired 
accuracy is pointed out, but the procedure requires that [H] be known 
accurately. Any desired accuracy for [H] can be obtained by a proce- 
dure of successive approximations, but the resulting expressions are in 
general too complicated to be of practical interest. 

Concerning the validity of the following derivation, it is important 
to realize that even though the expressions obtained for y¥ will not 
satisfy the actual boundary conditions exactly, the errors will be small, 
of order two in 1/ka. These errors imply the conditions satisfied at the 
boundary by the expressions in question can be obtained, from the 
actual conditions, by small perturbations, of order two in 1/ka. 


ltl. ASYMPTOTIC PROPERTIES OF y 


We now determine the dependence of (x, y) upon the matrix [H]. 
To this purpose, it is convenient to assume that [H ] is independent of 
kha. The expressions obtained for (x, y) will depend on the coefficients 
of [H]. By substituting for these coefficients the expressions obtained 
in Section II, one obtains the dependence of { upon ka in general, for 
a waveguide with frequency dependent [H]. 

Thus, consider a waveguide characterized by a given matrix [H], 
and let 


o° = k* — p°. (20) 
Let ¥(x, y) and o” be expanded in power series of I/ka, 
2. - ih 
y = Woo(x, y) + 2 Ra ¥)s (21) 
ta oi (1+ y i) (22) 
imi (Ra)' 


where the distributions y.., yi, etc., are independent of ka; they are 
determined entirely by the shape of the boundary and the coefficients 
of [H]. 

Using eqs. (21) and (22) one can derive from eq. (5) for large ka (see 
Appendix A) a set of linear relations involving y and the normal 
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derivative d)/av, 





ay, 
vy 1 ov 2 
=; Alyy | +O[k7] on. (23) 
v, 7 — 
OV 


Expressing y, and y, in terms of the x- y-components of ¥, we obtain 


Ox 
Wx 1 Dp: 
= lA] ’ | +O[k] on, (24) 
" ~ 
-OV 
where 
Vx, pv. Vx, Vy 
cay= |" Z| [H] is ‘i | (25) 


vz and v, being the direction cosines of p. 


3.1 Derivation of ).. and c,; 


We now show that for each nondegenerate eigenvalue o., there are 
in general two modes, characterized by different values of c;. For most 
waveguides, [ H] has the diagonal form (19), and in this case we shall 
see that y.. is linearly polarized. Furthermore, if the boundary has an 
axis of symmetry, then the polarization vector i of y.. is either parallel 
or orthogonal to the symmetry axis. Very simple expressions are 
obtained in this case for Y, c1, ¥1. More difficult is the treatment for 
degeneracy of order N > 1. Then, in order to determine y.., one must 
find the 2N latent roots of a certain characteristic equation. 

The function must satisfy the wave equation, 


Viw + op = 0, (26) 


V. being the transverse part of V. Equation (24) implies that the 
boundary values of y vanish in the limit as k > o.,”* It is, therefore, 
convenient to represent y in terms of the eigenfunctions f,, fo, etc., 
that satisfy the boundary condition 


f-=0  onC. (27) 
Let u; be the eigenvalue of f,, 
| Vif- turf, = 0. (28) 
From equations (21) and (22) for k — ©, one has 
Y> Yo, = > ns, (29) 


* From now on the waveguide dimensions will be kept fixed, as & is increased. 


HIGH-FREQUENCY BEHAVIOR OF WAVEGUIDES 97 


and, therefore, y.. must satisfy the wave equation with o replaced by 
o.. Furthermore, from eq. (24), 


Yo = 0 on C. (30) 
Therefore, o.. must be one of the eigenvalues u,. Let 
On = U1 (31) 


and suppose there is degeneracy of order N, so that N distinct eigen- 
functions f\, ---, fy correspond to the same eigenvalue. Then 


uy = e+e = UN (32) 
and y.. can be written in the form 
N 


N 
Yo = > xm fm(X, y) i, + a» Oym fm (X, y) 1y; (33) 
m=1 m= 


involving N eigenfunctions and 2N coefficients axm, Qym. We now show 
that these coefficients cannot be chosen arbitrarily, but there are in 
general only 2.N possible choices corresponding to 2N distinct modes. 
3.2 Values Of Qxm, Qym 


The values of at any point (x’, y’) inside the boundary are related 
to the boundary values through the integral relation’ * 


V(x’, y’) = i V(x, Sat (34) 
Cc V 


where G = G(x, y; x’, y’) is Green’s function satisfying the equation 
ViGt+o°G = 8(x — x')8(y — y’) (35) 

and the boundary condition G = 0 when x, y is a point of C. Let G be 

represented in terms of the eigenfunctions [,, 

1 


2 
o — ue 





G= » f(x, ¥) f(x’, y’), (36) 


where it is assumed that f, are properly normalized so that they are 
real functions satisfying 


[| f2(x, y) dxdy = 1 (r= 1, 2, ---), (37) 
S ; 


S being the region inside the boundary C. From eq. (36), G contains a 
component 


Gu 





N 
2 f(x, ¥) fAx’, ¥’), (38) 


2 
Oo” — 0% 
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which diverges for k — ~, since o > o.. For large k, 
saa Bes b> 
G. = ka —y — ¥) fla, y) f(x’, y’). (39) 
On Ci ] 
The asymptotic behavior of eq. (34) for large k is now examined. 


Approximating G by G.., fromm eqs. (24), (25), and (34) one obtains the 
integral relation 





fA de | 3G 
He -; {| ae dl, (40) 
where 
ve = a (41) 


Substituting eq. (39) in eq. (40), and taking into account eq. (33), one 
obtains for the coefficients axm, @ym the characteristic equation 


tl Phe) [el || as) | 
le ie Al aa ~ 
where 
c= a=‘ (43) 
and 
et hi i Oh 
noha oy | Aan say (44) 


and similarly for I,,, Iyx, Ly (replace Axx with Axy, Ayx, Ayy). 
Equation (42) admits, in general , a total of 2N independent solu- 
tions a}, a2], +++, aan] for 


a] = mel (45) 


7 ay | 


Each solution is obtained by solving eq. (42) with c; set equal to one 
of the 2N latent roots A, Ae, ---, Aon of the matrix 


=|] tei “6 


If the boundary is lossless, then one can verify that [J] is Hermitian 
and its latent roots are real. In this case the 2N solutions are orthog- 
onal, 


a; jas |*é = 0 for is, (47) 


where ( )* denotes the transpose conjugate. 
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Thus, to determine the coefficients ax, a), etc., which specify y., 
the 2N latent roots of the matrix [7] must be determined. If the roots 
are all distinct, then they correspond to 2/N distinct modes character- 
ized by different c., ie., by different propagation constant £. If there 
is degeneracy (N > 1), the expressions obtained from eq. (33) for y.. 
are quite complicated. Much simpler is the treatment for N = 1, since 
then only one eigenfunction f(x, y) is involved. This is the most 
important case in practice, since it applies to the fundamental modes, 
which correspond to the lowest o.. (see Appendix B). 


3.3 CaseN= 171 


For N = 1, only one eigenfunction fi(x, y) corresponds to o.. and eq. 
(33) reduces to 


Yoo ia filx, y)i, (48) 


where i is a unit vector that determines the polarization of yw... If A1 ¥ 
A2, then eq. (42) for N = 1 specifies two polarizations, corresponding to 
two modes with different propagation constants. To determine these 
two polarizations, let a, and a, be the direction cosines of i. Then from 
eq. (42) with N = 1, 


Tee — C1 Ley Ax} _ 
ie i= | | =e ay 
where 
2 
re] 
des = = | Axx 2] dl, (50) 
Tx Jo ov 


and similarly for J,,, I., Iyy (replace Axx with A,,, etc.). From eq. (49) 
one obtains for c; the characteristic equation 


(Tax = C1) (Ly C1) aa Leylyz; (51) 


whose solutions A; and Az determine for i two eigenvectors i; and iz 
with direction cosines specified by eq. (49). Notice ii = iz if Ai ¥ Aco. If 
the boundary is lossless and Ai ¥ Az, then from eq. (47) 


i, -if = 0, (52) 


and, therefore, the two eigenvectors represent orthogonal polarizations. 
For all the waveguides of Fig. 2, [H] is given by eq. (19). Then 


a of, ° 
1 
iy = pes = es I (Y = X)ran] dl, (53) 
a af ' 
Ox Jo ov 
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and similarly for J,, (interchange x = y). Since in this case [J] is a 
symmetric matrix, 


i, - in = 0, (55) 


and if i; is real (y. is linearly polarized), then i, is also real and 
orthogonal to ij. 

We conclude by deriving a general condition that must be satisfied 
so that y.. is linearly polarized. Suppose i is real. Then the x axis can 
be oriented so that i = i,. This implies a, = 0 and therefore J,, = 0. We 
conclude that linear polarization is obtained only if it is possible to 
orient the x axis so that 


2 
| (Y- Xyran] dl = 0. (56) 
é ov 


Notice then I,, = I,, = 0, and, therefore, 
i, = I ip = iy. (57) 


The above requirement is always satisfied if the boundary is lossless 
(then X and Y are real) or if the values of X and Y are independent of 
position 7 on the boundary. It is also satisfied if the boundary has an 
axis of symmetry. In fact, let the symmetry axis be the x axis. Then X, 
Y and df, /dv in eqs. (53) and (54) are even functions of y, 


& (x, y) aes X (x, —y), Y (x, y) = Y (x, =); (58) 


whereas p,v, is an odd function of y, which gives condition (56). 

We thus conclude that in most cases of practical interest y,. is 
linearly polarized. This is of importance in the design of a feed for 
reflector antennas, to obtain good cross-polarization discrimination 
over a wide frequency range. 


IV. PROPAGATION CONSTANT FOR N = 1 


Assume the boundary has an axis of symmetry given by the x axis 
and let N = 1. Let [H] be given by eq. (19), which applies to all 
waveguides of Fig. 2. Then, for the mode polarized in the x direction, 
the coefficient c,; coincides with J,, and it is given by eq. (54). Notice 
eq. (54) assumes that fi(x, y) is normalized as shown by eq. (37). If fi 
is not normalized, we must divide the right-hand side of eq. (54) by the 
left-hand side of eq. (37) with r = 1, then obtaining 


(Z, v3/Z + Zv2/Z.) (afi /av)* dl 
Oe (59) 


uj {J fi dxdy 
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for i = i,. For the other polarization i = i,, interchange v, = pv, in the 
above expression, which shows that the two polarizations are in general 
characterized by different propagation constants. 

Once c, is known, the propagation constant 8 can be derived using 
eqs. (20) and (22) which for o. = uw give 


Pape. peed nates (60) 
a 


where the dots indicate terms of order higher than two in 1/k. If the 
medium inside the waveguide is lossless, & is real and the attenuation 
constant 7 is determined by the imaginary part of c:. Then eqs. (59) 
and (60) give 


(rv3 + gv?) (df: /av)? dl 
7 = —Im(f) = 


2k? {| fi dxdy 


where r and g are the real parts of Z,/Z and Z/Z.. This relation was 
used in Ref. 1 to determine the attenuation constant for a variety of 
waveguides of practical interest. 

Using the above expressions one can straightforwardly calculate the 
dispersion and attenuation characteristics of any mode for large ka. In 
the special case of a hollow waveguide of dielectric with circular 
boundary, eqs. (59) and (60) give eq. (31) of Ref. 2. 

Of greatest importance are the fundamental modes, which corre- 
spond to the lowest o... Then, for the circular boundary of Fig. 3, ~ 


filx, y) = Ado(cxp), (62) 


where p = Vx" + y’, Jo is the Bessel function of order zero, and o..a is 
the first root of Jo, 


(61) 


0.a = 2.4048. (63) 
For the rectangular boundary of Fig. 3, 


fi(x, y) = A cos (5 *) cos(5 2), (64) 


If fi(x, y) is normalized [see eq. (37)], then 


oe a (circle) 
a sl op Jy ( Ou a) ‘ 
A= (65) 
ae (rectangle) 
Vab 
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be — a— —>| 
y 
1b 
x Ag 
Fig. 3—Circular and rectangular boundaries. 


V. THE DISTRIBUTION ¥, 


Once y.. is known, the boundary values of ~ can be calculated with 
error of order two in 1/2 using eq. (23). If N = 1,i =i, and [H] is given 
by eq. (19), then one obtains at the boundary 


y= -; | x + Yv2)ir — (X- Y)rn%5 2 on. (66) 


To determine vy inside the boundary we separate y into two parts, a 
component 


filx, yi, 


plus a component due to the other eigenfunctions fs, fs, etc. The latter 
component can be determined with error of order two in 1/k by 
substituting eq. (66) in the integral of eq. (34), with G replaced by G 
— G. for o = o., as shown in Appendix A. Concerning the former 
component, it is shown in Appendix A that if the boundary has an axis 
of symmetry, then i is independent of k, and, therefore, one can set 
i = i, for all values of &. If there is no symmetry, then in general iis a 
function of & and, to determine its dependence upon k, one must follow 
the same procedure used in this section to determine i for k > ». 


Vi. APPLICATIONS 


We now derive the fundamental modes of circular and rectangular 
boundaries with diagonal matrix [H] given by eq. (19), which applies 
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to all the waveguides of Fig. 2. The surface parameters X and Y will be 
assumed to be independent of position on the boundary. Then for 


i = i,, 
Yoo a fix, y)ix, (67) 


giving y in the limit as k — o. We now wish to obtain a better 
approximation for y. 

For the rectangular boundary of Fig. 3b in eq. (66), one has p, vr, = 0, 
and, therefore, y has the same polarization as yj, 1.e., 


Y= pir, 
neglecting terms of order two in 1/k. The boundary condition (23) can 


then be satisfied separating ¥ into a product of two functions, y(x) 
and y2(y), subject to the conditions 


Y ah 
= =+ 
Wy rans for x= ca, (68) 
Xa 
= cn y= +b, (69) 
k ov 


whose solutions are well known.”**! For the fundamental mode one 
obtains 


y = A cos ax cos yy, (70) 


where 


7 7 >< 
c= Sf1- 2], veal Fae], (71) 


These results are closely related to expressions derived in Ref. 3 for a 
dielectric waveguide. Notice that for b > © the rectangular waveguide 
degenerates into two parallel plates placed at x = +a, in which case 
yy — v(x) and the modes can be derived exactly as in Appendix C. 
Similarly, for a > © one obtains two plates at y = +b and Y > yr(y). 
If both a and 0 are finite, then eq. (70) shows that y is simply the 
product of the two distributions (x) and y¥.2(y), provided terms of 
order two in 1/k can be neglected. 

Next consider the circular waveguide of Fig. 3. In this case it is 
convenient to introduce polar coordinates p, ¢. Taking into account 
that X and Y are independent of ¢, one obtains for the fundamental 
mode 


y= 


Al Jolop)i gee Ame: 


4 ka 








J2(op)(cos2oi, + sin2¢i,) + --- , (72) 
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where the dots indicate terms of order two in 1/k, and from eqs. (22) 
and (59), 


(73) 





0a = 0na nnd ; + 
i 2 ka : 


0.a being given by eq. (63). These expressions were derived previously 
for X = 0 in Ref. 5 and therefore details of their derivation are not 
given here. 

The above results for rectangular and circular waveguide are valid 
provided & is large enough so that the field at the boundary is small. 
From eq. (66) this requires 

1 xX 


—_<l — 
, ka 


Y 
<1l—-<lL 
ka 


ka 


If the waveguide dimensions are large enough, the results apply even 
to an ordinary waveguide with metal walls of finite conductivity. To 
determine how large the dimensions must be, consider for instance 
copper at 100 GHz. Then R,/Z = 2.189 x 10~*, and from Fig. 2d one 
obtains | Y| = 514 and |X| =5 x 107°. Therefore, the above inequalities 
require 


2a > 1000A, 


which is too large a diameter for all practical purposes. 

The above requirement is a consequence of the large value of Y for 
copper walls. By coating the walls with a thin dielectric layer, or by 
corrugating them, much lower values of Y can be obtained. Suppose 
for instance in Fig. 2e one chooses T = 1, R; = 0, m = 1, and nj = 2. 
Then, instead of the above requirement, one obtains a >>0.318A, which 
is a much more realistic condition. The attenuation constant of such 
a waveguide, or of other waveguides realized using one of the structures 
of Fig. 2, can be determined straightforwardly using eq. (61).’ 

When a waveguide is used to illuminate a feed aperture, then at that 
aperture usually ka > 1. Then the aperture illumination is given 
accurately by eqs. (70) or (72) for rectangular and circular apertures. 
For other apertures, it can be determined as pointed out in Section V. 
By deriving the Fourier transform of eqs. (70) and (72) the far-field 
can be determined and thus its dependence on the aperture parameters 
X and Y can be obtained. These applications are discussed in Ref. 1. 


Helical waveguide— A case where [H] is not a diagonal matrix 


Consider one of the waveguides of Fig. 2, and let a helical wire be 
placed at the boundary, as in Fig. 4. Then one finds that [H] is not a 
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7 BOUNDARY C 
/ 


Jonna tna 


Fig. 4—Helix with pitch angle 6. 


diagonal matrix, 
—1 
Le) XY ane 
Y, J sin@ cosé(1 + XY) (74) 
. —j sin@ cos#(1 + XY), xX t 


where X, Y denote the coefficients of [H] for 0 — 0, @ being the pitch 
angle between the wire and 7. 

Consider a circular boundary. Then, we have seen that for 6 = 0 the 
fundamental modes have the same propagation constant, and are 
linearly polarized for k — ©. In this section we show that for 6 # 0 
they become circularly polarized, and have different propagation con- 
stants B; and £2. This implies the following. Suppose at the input of 
such a waveguide (Fig. 4) the fundamental modes are combined so as 
to obtain, to a good approximation, linearly polarized excitation. Then, 
one will not obtain, in general, linear polarization at the output, unless 
the difference between the two propagation constants is small enough 
so that 


(B2 -Bl«e, 


l being the waveguide length. In this section we derive B, and f;. 
Helical waveguides are of importance for their simplicity of construc- 
tion, as compared to corrugated waveguides, and for their excellent 
performance as hybrid-mode feeds, as shown recently by R. H. Turrin® 
whose work motivated the calculation of this section. The following 
results agree with Ref. 34. 

Let consideration be restricted to the fundamental modes of a 
lossless boundary with [H] independent of position on the boundary. 
Then, from eqs. (25) and (44) 


Tyg = Av xx - (Ae r Ax) vxy + A 22ryy, (75) 
le = (Ai; — A22) vy + Mv xx 7 Aa ryy, (76) 
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and similarly for J,, and I,, (interchange 1 = 2, x = y), where 
1 | af, |° 
Vik = ViVR 


tasty dl (j,k = x, y). (77) 
ov 
Since the boundary is assumed to be lossless, 
(Im) (Jy) ces Ai2( xx se Vyy)s (78) 


and, therefore, I, 4 0. We conclude that for Hi2 # 0 there is no 
degeneracy possible for the fundamental modes. For a circular bound- 
ary, using eqs. (74), (25), (50), (51), and (62), we obtain 





1 
ee (79) 
/2 y: 
and 
a 2(cos*@ — XY sin’@) 





(eaten Pry Pore colin) Oren We -7 
ka ka 


with the plus sign of eq. (79) corresponding to the minus sign of eq. 
(80). The same expressions apply also to a square aperture with a = b. 
Thus, in both cases y.. is circularly polarized. If XY + 1 > 0, then the 
lower value of o corresponds to a mode with polarization rotating in 
the sense of the helix in Fig. 4. The opposite is true for XY + 1 < 0. 


Vil. CONCLUSIONS 


To summarize, we have shown for most of the modes inside a 
cylindrical waveguide of finite surface impedances that asymptotically, 
for large values of ka, the field ¥ vanishes at the boundary. We have 
seen in Section III that for each eigenvalue uw, there are, in general, 2N 
modes, given for k > © by eq. (33). For the lowest eigenvalue one has 
N = 1, and for the corresponding two modes y.. is given by f(x, y)i, 
where i is a unit vector. If the direction cosines of i are complex, then 
wv. is elliptically polarized. If [H] is a diagonal matrix, as for the 
waveguides of Fig. 2, and the boundary has an axis of symmetry, then 
iis real, and one can always orient the x axis so that i = i,. In this case 
the propagation constant £ is given by eqs. (59) and (60), and using 
the procedure of Appendix A we can straightforwardly determine y, 
with error of order two in 1/k, for any boundary shape. For rectangular 
and circular boundaries y is given by eqs. (70) and (72). 

Of special importance are the fundamental modes, which correspond 
to the lowest eigenvalue o... These modes, treated in Section 3.2, are 
needed in reflector antennas to minimize cross-polarization and edge 
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illumination over the feed aperture. They are also needed for long 
distance waveguide or fiberguide communication. Our results, show 
that there is no need to corrugate the walls of a feed in order to obtain 
conditions (11) and (48) or to obtain the low attenuations calculated in 
Refs. 20 and 21. Furthermore, they imply that the low attenuations 
predicted in Ref. 2 for a hollow waveguide of dielectric, or for a 
waveguide with metal walls coated with dielectric,* can be achieved 
also using other waveguides. These applications are discussed in 
Ref. 1. 


APPENDIX A 
A.1 Derivation of eq. (23) 
Taking into account that V-E = 0, one can express EF, in terms on 
the transverse component given by eq. (2). One obtains 
pe shz 
JB 
Using this relation and Maxwell’s equation —jwuH = V X E one can 


express H, and H, in terms of y. Substituting these expressions in eq. 
(5), we obtain the boundary condition 


E. = 





Vy. 


k 1 a 
be = (Vip) —— (Vy) 
|-}un B + Be (81) 
Ww, ~V,-(iz x W) 0 


Taking into account eqs. (21) and (22) for k > , we have 
y— 0, on C. 
This implies 


(as. yest =. 
ov ov 








Taking into account these relations, from eq. (81) we obtain eq. (23) 
with error of order two in 1/k. 


A.2 Development of 0° and y in Asymptotic Series of 1/ka 


A general procedure for deriving the various terms c,, y, in eqs. (21) 
and (22) is now described, thus justifying these equations. Assume that 
the boundary has at least one line of symmetry, since this simplifies 
considerably the derivation, and it applies to most cases of practical 
interest. Also assume that [H] has the diagonal form of eq. (19), and 
that a single eigenfunction /,(x, y) corresponds to ow. 

Since there is no degeneracy, y.. is given by eq. (48). Let the x axis 
coincide with the symmetry line. Then, the surface parameters X and 
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Y are even functions of y, 
X(x,-y) =X(x,y) and = Y(x, —y) = Y(x, y), (82) 


and we now show the modes can be divided in two groups, namely 
even modes Satisfying 


Wx(x, —y) = x(x, y), Wx, ~y) = —Yy(x, y), (83) 
and odd modes satisfying 
Wx(x, —y) = —Yx(x,y¥), W(x, —y) = Wy(x, y). (84) 
In fact, let 


y ee x(x, y)ix + Wy (x, y)iy 
be a solution of the wave equation and of the boundary condition (81) 
with [H] given by eq. (19). Then we wish to show that 
y’ = Px (x, —y)ix mae Wy (x, —y)i, 


is also a solution. Notice that w’ is the image of with respect to the 
x axis, as shown in Fig. 5, where P and P’ denote two corresponding 
points (x, y) and (x, ~—y). One can verify that 


(V: xX W’)p = —(Vi X p)p, (85) 
(Vew’)p = (Va) p. (86) 


If P is a boundary point, and vp’ and pv denote the normals to the 
boundary at P’ and P, respectively, one has that pv’ is the image of v 
because the boundary is symmetrical. Taking all this into account one 
can verify that W’ satisfies the boundary condition (81) with [H] given 
by eq. (19) at P’. We conclude that if an arbitrary solution y is known, 
two independent solutions y, and y, can be obtained by the relations 


We= “+H  vo= 4ly—v'L 7) 


Fig. 5—For a symmetrical boundary, the boundary conditions are satisfied by both 
y and its mirror image y’. 
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One solution, ¥., is even and the other, i, is odd. 
We now proceed to derive o” and y. Let i = i, and assume f(x, y) is 
even, i.e., 


filx, y) = fi(x, —y), (88) 


all other cases (i = i,, or f, odd) being entirely analogous. Then y, and 
yy are, respectively, even and odd and it is convenient to separate G 
into three parts, 


G = G. + Gi + G,, (89) 
where G,, Gy denote, respectively, the even and odd parts of G — G...* 


Taking into account that y, is even, from eqs. (34), (36), and (89) one 
obtains 








0G, 
= Aifi(x, y) + | Wx dl, (90) 
- Ov 
where 
0 
a dl. (91) 
Similarly, for y,, 
0G 
Wy = - Wy — dl. (92) 
” Ov 


In eq. (90), since y can be multiplied by an arbitrary constant, the 
coefficient A; can be chosen arbitrarily. We choose A; = 1, to be’ 
consistent with y.. = fi(x, y)ix. Then eq. (91) gives 


o—o,= [Ban (93) 


a basic relation that expresses o* in terms of the boundary values of 
wy. Expanding y, in a power series of 1/ka, from eq. (91) one obtains 
for the ith coefficient of 0’, 


1 of 
Ci = o. [ Wri Oy: dl, (94) 
where 7; is the x component of y; in eq. (21). From eq. (66), 
0 
Wer = —(Xv¥ — Yvja a (95) 
ap’ 


* Thus G, is obtained from eq. (36) considering only those terms for which f, are odd, 
whereas for G,, consideration is restricted to f, even with r > 1. 
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and, therefore, for i = 1, eq. (94) gives eq. (59). 
From eqs. (90) and (92), taking into account that at the boundary 
dp=— po = ¥, 








0G 
Oy = | Ovx by, = i By G; al. (96) 
c 
Thus, using eq. (95), we obtain 
0 
Yana | be + von S (SE) (97) 
for the values of 1 inside the boundary whereas for y,1, using eq. 
(66), we obtain 
6] 
Wy = | VxVy(x — Y) — h (=) dl, (98) 
Cc ee 


where ( ). denotes the value for o = o... Equations (97) and (98) allow 
w and its derivatives to be determined with an error O(1/k). Therefore, 
the right-hand side of eq. (81), which contains the factor 1/k, can now 
be calculated with an error O(1/k?). The boundary values of yx2, Wye 
are then obtained from eq. (81). 

Once these values are known, eq. (96) can be used to determine yx2 
and yo inside the boundary. Equation (81) can then be used again to 
determine the boundary values of ¥x3 3, whose values inside the 
boundary can then be calculated using eq. (96). By proceeding this 
way, we can successively calculate all y:, Y,;. Notice in eq. (96) that 
the kernels depend on o’. Therefore, to determine wi, ¥,; for i > 1, we 
must first calculate the coefficients ci, ---, c;-1 using eq. (94). Once 
these coefficients are known, the kernels must be developed in power 
series of 1/k, and then the first 1 — 1 terms in these series must be 
determined. These terms then allow eq. (96) to be used to determine 


Wri, Wyi- 


APPENDIX B 


Let f, be one of the eigenfunctions satisfying the boundary condition 
f; = 0 and the wave equation 


Vif + uifi = (), 


and let uw; be the lowest eigenvalue. This implies that if g(x, y) is an 
arbitrary function with continuous derivatives, then 


i geVig dxdy < ui if g” dxdy, (99) 
s s 
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where the inequality sign applies to any g(x, y) that is not an eigen- 
function corresponding to the eigenvalue uw. 

Condition (99) is now used to show that f; cannot have nodal lines 
inside S. In fact, suppose f; has a nodal line inside S, and let 


g= fi. 


Then, since V;g is discontinuous across the nodal line, g cannot be an 
eigenfunction, and, therefore, condition (99) should give an inequality. 
To evaluate the integral 


T= | | gVig dxdy (100) 


in the immediate vicinity of the nodal line, where V7g diverges because 
of the discontinuity of V.g, write 


eVig = VileVeg] — (Veg)? 


and notice that 


EVvig 


is continuous because g = 0 on the nodal line. It follows that gV?g 
does not diverge on the nodal line, and, therefore, its integral over a 
narrow strip containing the nodal line vanishes as the width of the 
strip goes to zero. Thus, 


[= {I fiVi fi dxdy = uj {| fi dxdy (101) 
s s 


and, therefore, condition (99) gives an equality, which implies /; cannot 
have nodal lines inside S. 

It is now shown that f; is the only eigenfunction corresponding to 
Uy. In fact, if fe is another eigenfunction corresponding to uw, then this 
must be true also for 


f=fit afe, 


where a is an arbitrary constant. But this is not possible, since one can 
always choose a causing f to have a nodal line inside S, and we have 
already seen that this violates condition (99). 


APPENDIX C 


Consider the modes propagating between two parallel planes or- 
thogonal to the x axis and let 2a be the spacing of the two planes. 
Assume at the boundary one of the conditions of Fig. 2, and let the x 
axis be oriented in the direction of propagation, so that there is no 
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variation in the y direction. Then 
y= ¥(x)i, (102) 


where 1 is a unit vector. For the TM-modes, i = ix, whereas for the TE- 
modes i = i,. For the former case E, = H, = 0 and, therefore, y is 
independent of the transverse impedance Z,. In the latter case H, = E, 
= 0 and y is independent of Z,. In either case one finds’”’ that the 
surface impedances Z, and Z, in eq. (9) can be determined straightfor- 
wardly. For two metal plates with dielectric coating of thickness d, for 
instance, Z, and Z, are determined entirely by £, d, and the refractive 
index n of the dielectric, 


k 
X =—————— tan [ d Vk’n? - F) , (103) 
[h2n? — B? ( 
and 
2 
= eee ae SS , (104) 


ene tan (a Vk?n? — F) 


which for B — k give eqs. (17) and (18). Analogous expressions are 
obtained for the other boundaries in Fig. 2. 
Consider the even modes with i = i,, 


Wy = Yi, = cosoxi,, (105) 


where from eq. (81) the wavenumber o must satisfy” 


Y 1 


gees See 106 
ka oatanoa n06) 


whose behavior for real values of oa is illustrated in Fig. 6a. If Y # », 
then Y/ka vanishes for ka > and, therefore, for most of the solutions 
of eq. (106), 


oa—> ma ——, Nea DO .02, (107) 


as ka — o., In addition to these solutions, for Y < 0 one mode exists 
for which oa is imaginary and® 


k 
oa— -j as a— 0, (108) 


Equation (107) implies that the boundary values of ~ vanish for ka 
—» oo, whereas eq. (108) implies that the mode is a surface wave whose 
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Y/ka 





Fig. 6—Relationship between oa and Y/ka for the TM-modes of a parallel plate 
waveguide. Notice in the dielectric-lined waveguide the spacing 2a of the two plates 
increases with z. As a consequence, the field amplitude at the boundary decreases 
with a, for the mode with m = 1. 


amplitude in the vicinity of the walls is given by 


1 k k 
b= fewn( +t a] exo( Tle a i (109) 
2 Pty] ie 
Notice eq. (107) implies 
B= k, (110) 


as ka — oo, whereas for the surface wave 


1 
bok L+s3 7k. (111) 


To understand the significance of these results, consider two metal 
plates with dielectric coating as in Fig. la. Let the separation a of the 
two plates be gradually increased in the direction of propagation, as 
shown in Fig. 6, and let a > » as z — o., Let the dielectric thickness 
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d be so small that initially, for z = 0, 


Y 
hie (112) 


Then the modes for z = 0 are essentially those of an ordinary waveguide 
without dielectric coating, 


oa=mn (m = 0, 1, 2, ---). (113) 


For z > 0, however, the magnitude of Y/ka decreases with z, and 
Y/ka — 0 for z — o. This implies, for all the modes with m ¥ 0, 
conditions (107) and (110) and, therefore, the boundary values of 
vanish for z —> «©. For m = 0, on the other hand, oa is imaginary and 
initially ca = 0. This mode will degenerate for z — © into a surface 
wave with propagation constant determined by Y as shown by eq. 
(111). This is the only mode for which the field does not become 
infinitesimal at the walls for z — o. For all the other modes the 
boundary values of y¥ for large ka are given by 


e yy se, (114) 


which vanishes for ka > ~. 

The above considerations apply also to the TE-modes. In fact, it can 
be verified that if in eq. (2) E, is replaced with ZH; so that y represents 
the transverse component of ZH;, then, for the even modes, y is still 
given by eqs. (105) and (106), provided Y is replaced with X. The 
behavior of the odd modes is entirely analogous; simply replace 
cos ox with sin ox in eq. (105), and tan with — cotan in eq. (106). 
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